Entity · technique

episodic RAG

techniqueactiveepisodic-rag-ec7a5db9·1 events·first seen May 28, 2026

Aliases: episodic RAG

Co-occurring entities

RLVR GRPO CORE (Contrastive Reflection)MemRL GEPA

More like this (12)

RAG Agentic RAG Graph RAG GLM-RAG ComoRAG APS-RAG HippoRAG RAG Triad fastRAG VerbatimRAG episodic context GraphRAG

Recent events (1)

7arXiv · cs.AI·May 28, 2026·source ↗

CORE: Contrastive Reflection for Sample-Efficient Reasoning Improvement

CORE (Contrastive Reflection) is a non-parametric learning algorithm that improves LLM reasoning by comparing successful and unsuccessful reasoning traces to generate compact natural-language 'insights' about reasoning strategies. Across four reasoning tasks, CORE outperforms both parametric baselines (GRPO/RLVR) and non-parametric baselines (GEPA, episodic RAG, MemRL) under fixed rollout budgets, achieving comparable or better gains with as few as five training samples. The method is also more context-efficient than prompt-optimization approaches, storing learned knowledge as interpretable natural-language descriptions rather than raw traces or weight updates. The results suggest contrastive distillation of reasoning traces may be a more efficient route to self-improvement than traditional fine-tuning.

Evaluation and Benchmarking Inference Economics RLVR GRPO CORE (Contrastive Reflection)+5 more