benchmark
MemTraceBench
benchmarkactiveprovisional
memtracebench-80ae7be2·1 events·first seen 20d agoAliases: MemTraceBench
Co-occurring entities
More like this (12)
Recent events (1)
MemTrace: Framework for Tracing and Attributing Errors in LLM Memory Systems
MemTrace introduces a framework that converts LLM memory pipelines into executable memory evolution graphs to enable fine-grained error tracing and root-cause attribution. The authors construct MemTraceBench, a benchmark covering Long-Context, RAG, Mem0, and EverMemOS memory systems, to systematically characterize memory failure modes such as information loss and retrieval misalignment. An automatic attribution method iteratively traces operation subgraphs to pinpoint failures, and the resulting signals are used to guide prompt optimization in a closed-loop system that improves end-task performance by up to 7.62%.