Almanac
benchmark

MemTraceBench

benchmarkactiveprovisionalmemtracebench-80ae7be2·1 events·first seen 20d ago

Aliases: MemTraceBench

Co-occurring entities

More like this (12)

Recent events (1)

6arXiv · cs.CL·20d ago·source ↗

MemTrace: Framework for Tracing and Attributing Errors in LLM Memory Systems

MemTrace introduces a framework that converts LLM memory pipelines into executable memory evolution graphs to enable fine-grained error tracing and root-cause attribution. The authors construct MemTraceBench, a benchmark covering Long-Context, RAG, Mem0, and EverMemOS memory systems, to systematically characterize memory failure modes such as information loss and retrieval misalignment. An automatic attribution method iteratively traces operation subgraphs to pinpoint failures, and the resulting signals are used to guide prompt optimization in a closed-loop system that improves end-task performance by up to 7.62%.