后端: 1. Memory 管理面 API 落地(“我的记忆”增删改查 + 恢复) - 补齐 List/Get/Create/Update/Delete/Restore 的 handler、请求模型与返回视图 - 注册 `/api/v1/memory/items*` 路由并接入 MemoryHandler - 新增 memory item not found / invalid memory type / invalid memory content 三类管理面错误码 2. Memory Module / Service / Repo 扩展为“可管理 + 可治理”门面 - 新增 NewModuleWithObserve / ObserveDeps,导出 GetItem / CreateItem / UpdateItem / DeleteItem / RestoreItem / RunDedupCleanup / MemoryObserver / MemoryMetrics - 新增手动新增、修改、恢复能力;删除链路切到 SoftDeleteByID;所有管理动作统一事务内写 audit,并桥接向量同步与管理面观测 - 补齐 CreateItemFields / UpdateItemFields、单条 Create、管理侧字段更新、软删/恢复,以及 dedup 扫描/归档所需 repo 能力 - 审计操作补齐 archive / restore 3. Memory 读侧与注入侧观测补齐 - HybridRetrieve 返回 telemetry,统一记录 pinned hit / semantic hit / dedup drop / degraded / RAG fallback,并上报读取命中、去重丢弃、RAG 降级指标 - AgentService 持有 memory observer / metrics;injectMemoryContext 对读取失败、空注入、成功注入补齐结构化日志与注入计数 4. Worker / 决策 / 向量同步链路治理增强 - 召回结果显式携带 fallbackMode;hash 精确命中、rag→mysql 降级、最终动作统一写入决策观测 - 接入 vectorSyncer / observer / metrics;为 job 重试、任务成功/失败、决策分布与 fallback 补齐打点;向量 upsert/delete 统一改走公共 Syncer,并收敛 parseMemoryID 解析逻辑 5. 启动层接入 Memory 观测依赖 - 启动时创建 LoggerObserver + MetricsRegistry,并通过 NewModuleWithObserve 注入 memory 模块 前端:无 仓库:无
214 lines
5.7 KiB
Go
214 lines
5.7 KiB
Go
package vectorsync
|
|
|
|
import (
|
|
"context"
|
|
"fmt"
|
|
"log"
|
|
"strings"
|
|
|
|
infrarag "github.com/LoveLosita/smartflow/backend/infra/rag"
|
|
memoryobserve "github.com/LoveLosita/smartflow/backend/memory/observe"
|
|
memoryrepo "github.com/LoveLosita/smartflow/backend/memory/repo"
|
|
"github.com/LoveLosita/smartflow/backend/model"
|
|
)
|
|
|
|
// Syncer 负责 memory_items 与向量库之间的最小桥接。
|
|
//
|
|
// 职责边界:
|
|
// 1. 只负责“把已经落库的记忆同步到 RAG / 从 RAG 删除”;
|
|
// 2. 不负责决定哪些记忆该写、该删、该恢复,这些决策仍由上游 service/worker/cleanup 控制;
|
|
// 3. 同步失败时只回写 vector_status 并打观测,不反向回滚业务事务,避免把在线链路拖成强依赖。
|
|
type Syncer struct {
|
|
ragRuntime infrarag.Runtime
|
|
itemRepo *memoryrepo.ItemRepo
|
|
observer memoryobserve.Observer
|
|
metrics memoryobserve.MetricsRecorder
|
|
logger *log.Logger
|
|
}
|
|
|
|
func NewSyncer(
|
|
ragRuntime infrarag.Runtime,
|
|
itemRepo *memoryrepo.ItemRepo,
|
|
observer memoryobserve.Observer,
|
|
metrics memoryobserve.MetricsRecorder,
|
|
) *Syncer {
|
|
if observer == nil {
|
|
observer = memoryobserve.NewNopObserver()
|
|
}
|
|
if metrics == nil {
|
|
metrics = memoryobserve.NewNopMetrics()
|
|
}
|
|
return &Syncer{
|
|
ragRuntime: ragRuntime,
|
|
itemRepo: itemRepo,
|
|
observer: observer,
|
|
metrics: metrics,
|
|
logger: log.Default(),
|
|
}
|
|
}
|
|
|
|
// Upsert 把新增/修改/恢复后的记忆同步到向量库。
|
|
func (s *Syncer) Upsert(ctx context.Context, traceID string, items []model.MemoryItem) {
|
|
if s == nil || s.ragRuntime == nil || s.itemRepo == nil || len(items) == 0 {
|
|
return
|
|
}
|
|
|
|
requestItems := make([]infrarag.MemoryIngestItem, 0, len(items))
|
|
for _, item := range items {
|
|
requestItems = append(requestItems, infrarag.MemoryIngestItem{
|
|
MemoryID: item.ID,
|
|
UserID: item.UserID,
|
|
ConversationID: strValue(item.ConversationID),
|
|
AssistantID: strValue(item.AssistantID),
|
|
RunID: strValue(item.RunID),
|
|
MemoryType: item.MemoryType,
|
|
Title: item.Title,
|
|
Content: item.Content,
|
|
Confidence: item.Confidence,
|
|
Importance: item.Importance,
|
|
SensitivityLevel: item.SensitivityLevel,
|
|
IsExplicit: item.IsExplicit,
|
|
Status: item.Status,
|
|
TTLAt: item.TTLAt,
|
|
CreatedAt: item.CreatedAt,
|
|
})
|
|
}
|
|
|
|
result, err := s.ragRuntime.IngestMemory(memoryobserve.WithFields(ctx, map[string]any{
|
|
"trace_id": traceID,
|
|
}), infrarag.MemoryIngestRequest{
|
|
TraceID: traceID,
|
|
Action: "add",
|
|
Items: requestItems,
|
|
})
|
|
if err != nil {
|
|
s.observer.Observe(ctx, memoryobserve.Event{
|
|
Level: memoryobserve.LevelWarn,
|
|
Component: memoryobserve.ComponentWrite,
|
|
Operation: "vector_upsert",
|
|
Fields: map[string]any{
|
|
"trace_id": traceID,
|
|
"item_count": len(items),
|
|
"success": false,
|
|
"error": err,
|
|
"error_code": memoryobserve.ClassifyError(err),
|
|
},
|
|
})
|
|
for _, item := range items {
|
|
_ = s.itemRepo.UpdateVectorStateByID(ctx, item.ID, "failed", nil)
|
|
}
|
|
return
|
|
}
|
|
|
|
vectorIDMap := make(map[int64]string, len(result.DocumentIDs))
|
|
for _, documentID := range result.DocumentIDs {
|
|
memoryID := parseMemoryID(documentID)
|
|
if memoryID <= 0 {
|
|
continue
|
|
}
|
|
vectorIDMap[memoryID] = documentID
|
|
}
|
|
|
|
for _, item := range items {
|
|
vectorID := strPtrOrNil(vectorIDMap[item.ID])
|
|
_ = s.itemRepo.UpdateVectorStateByID(ctx, item.ID, "synced", vectorID)
|
|
}
|
|
s.observer.Observe(ctx, memoryobserve.Event{
|
|
Level: memoryobserve.LevelInfo,
|
|
Component: memoryobserve.ComponentWrite,
|
|
Operation: "vector_upsert",
|
|
Fields: map[string]any{
|
|
"trace_id": traceID,
|
|
"item_count": len(items),
|
|
"document_count": len(result.DocumentIDs),
|
|
"success": true,
|
|
},
|
|
})
|
|
}
|
|
|
|
// Delete 把一批记忆对应的向量从向量库中删除。
|
|
func (s *Syncer) Delete(ctx context.Context, traceID string, memoryIDs []int64) {
|
|
if s == nil || len(memoryIDs) == 0 {
|
|
return
|
|
}
|
|
if s.ragRuntime == nil || s.itemRepo == nil {
|
|
return
|
|
}
|
|
|
|
documentIDs := make([]string, 0, len(memoryIDs))
|
|
for _, id := range memoryIDs {
|
|
documentIDs = append(documentIDs, fmt.Sprintf("memory:%d", id))
|
|
}
|
|
|
|
err := s.ragRuntime.DeleteMemory(memoryobserve.WithFields(ctx, map[string]any{
|
|
"trace_id": traceID,
|
|
}), documentIDs)
|
|
if err != nil {
|
|
s.observer.Observe(ctx, memoryobserve.Event{
|
|
Level: memoryobserve.LevelWarn,
|
|
Component: memoryobserve.ComponentWrite,
|
|
Operation: "vector_delete",
|
|
Fields: map[string]any{
|
|
"trace_id": traceID,
|
|
"item_count": len(memoryIDs),
|
|
"success": false,
|
|
"error": err,
|
|
"error_code": memoryobserve.ClassifyError(err),
|
|
},
|
|
})
|
|
for _, memoryID := range memoryIDs {
|
|
_ = s.itemRepo.UpdateVectorStateByID(ctx, memoryID, "failed", nil)
|
|
}
|
|
return
|
|
}
|
|
|
|
for _, memoryID := range memoryIDs {
|
|
_ = s.itemRepo.UpdateVectorStateByID(ctx, memoryID, "deleted", nil)
|
|
}
|
|
s.observer.Observe(ctx, memoryobserve.Event{
|
|
Level: memoryobserve.LevelInfo,
|
|
Component: memoryobserve.ComponentWrite,
|
|
Operation: "vector_delete",
|
|
Fields: map[string]any{
|
|
"trace_id": traceID,
|
|
"item_count": len(memoryIDs),
|
|
"success": true,
|
|
},
|
|
})
|
|
}
|
|
|
|
func parseMemoryID(documentID string) int64 {
|
|
documentID = strings.TrimSpace(documentID)
|
|
if !strings.HasPrefix(documentID, "memory:") {
|
|
return 0
|
|
}
|
|
raw := strings.TrimPrefix(documentID, "memory:")
|
|
if strings.HasPrefix(raw, "uid:") {
|
|
return 0
|
|
}
|
|
var value int64
|
|
for _, ch := range raw {
|
|
if ch < '0' || ch > '9' {
|
|
return 0
|
|
}
|
|
value = value*10 + int64(ch-'0')
|
|
}
|
|
return value
|
|
}
|
|
|
|
func strPtrOrNil(v string) *string {
|
|
v = strings.TrimSpace(v)
|
|
if v == "" {
|
|
return nil
|
|
}
|
|
value := v
|
|
return &value
|
|
}
|
|
|
|
func strValue(v *string) string {
|
|
if v == nil {
|
|
return ""
|
|
}
|
|
return strings.TrimSpace(*v)
|
|
}
|