Entity · model

Mem-π

modelactivemem--235cdbb1·1 events·first seen May 21, 2026

Aliases: Mem-π

Co-occurring entities

web navigation benchmark large language model agents decision-content decoupled reinforcement learning episodic memory retrieval

More like this (12)

EntityMem MemProbe oh-my-pi memory π0 MemOps π₀.₅ Phi-2 EverMemOS MA²P u-muP LamPO

Recent events (1)

6arXiv · cs.CL·May 21, 2026·source ↗

Mem-π: Adaptive Memory for LLM Agents via On-Demand Generation and Decoupled RL

Mem-π introduces a framework where a dedicated language or vision-language model generates context-specific guidance for LLM agents on demand, rather than retrieving static entries from episodic memory banks. The system is trained with a decision-content decoupled reinforcement learning objective that jointly learns when to generate guidance and what to generate, enabling abstention when generation would not help. Evaluated across web navigation, terminal-based tool use, and text-based embodied interaction benchmarks, Mem-π achieves over 30% relative improvement on web navigation tasks compared to retrieval-based and prior RL-optimized memory baselines.

Evaluation and Benchmarking Agent and Tool Ecosystem web navigation benchmark Mem-π large language model agents +3 more