5arXiv cs.CL (Computation and Language)·14h ago

RAPS-DA: Regime-aware peer specialization for robust RAG under knowledge conflicts

A new arXiv preprint introduces RAPS-DA, a training framework for making RAG systems more robust when retrieved context conflicts with a model's parametric knowledge. The approach divides conflicts into three reliability regimes (Grounding, Arbitration, Resistance) and trains separate peer specialist models per regime from a shared base, using reverse-KL supervision and a dual-layer token selector to filter uninformative training signals. Peer specialists exist only during training, so the deployed student model requires no additional components at inference time. Experiments across five conflict scenarios and two out-of-distribution benchmarks show RAPS-DA outperforms prompting, decoding, fine-tuning, RL, and single-teacher baselines.

Evaluation and Benchmarking Agent and Tool Ecosystem RAPS-DA

Related guides (2)

Agent and Tool EcosystemTopic guide

Agent and Tool Ecosystem: How AI Is Learning to Act, Not Just Answer

Read asBeginner In-depth

Evaluation and BenchmarkingTopic guide

Evaluation and Benchmarking: How We Measure AI — and Why It Keeps Getting Harder

Read asBeginner In-depth

Related events (8)

5arXiv · cs.CL·5d ago·source ↗

Comparative framework for RAG variants: when GraphRAG and Agentic RAG are actually needed

A new arXiv preprint introduces a systematic evaluation framework comparing nine standardized RAG scenarios across regular RAG, GraphRAG, Modular RAG, and Agentic RAG on semi-structured knowledge bases. The authors propose a novel context engineering method that reduces token usage by 19–53% for GraphRAG and Agentic RAG by addressing context/memory overflow. A key finding is a 'retrieval-generation gap' where expanded retrieval does not proportionally improve generation quality, suggesting retrieval-oriented metrics overstate the benefits of advanced retrieval. The work targets practitioners building production RAG systems and provides data-driven guidance on when to use each variant.

Evaluation and Benchmarking Enterprise Deployment Patterns Agentic RAG GraphRAG Is GraphRAG Needed? From Basic RAG to Graph-/Agentic Solutions with Context Optimization +1 more

5arXiv · cs.LG·25d ago·source ↗

SARDI: Self-Augmenting Retrieval for Diffusion Language Models using lookahead tokens

Researchers introduce SARDI, a training-free RAG framework for discrete diffusion language models that repurposes discarded low-confidence tokens during denoising as lookahead signals to guide retrieval before output is finalized. The method is retriever-agnostic and applicable to any reasoning-capable discrete diffusion LM. Evaluated across five multi-hop QA benchmarks, SARDI outperforms training-free diffusion and autoregressive retrieval baselines at up to 8x higher throughput.

Evaluation and Benchmarking Agent and Tool Ecosystem Self-Augmenting Retrieval for Diffusion Language Models SARDI

5arXiv · cs.CL·21d ago·source ↗

AdvGRPO: Stable co-training framework for adaptive red teaming of language models

Researchers introduce AdvGRPO, a co-training framework that makes GRPO viable for joint attacker-defender optimization in LLM red teaming, addressing previously reported instability. The method uses dense multi-channel rewards and decoupled advantage normalization, with a curriculum progressing from single-turn to multi-turn attacks before bootstrapping co-training. Co-trained defenders outperform baselines on safety benchmarks, and the attacks show transferability across models.

AI Safety Research Alignment and RLHF AdvGRPO GRPO PPO +1 more

4arXiv · cs.CL·6d ago·source ↗

Multi-agent semantic rewriting framework for privacy-preserving RAG

A new arXiv preprint proposes a three-agent framework for sanitizing retrieved content in RAG pipelines by performing privacy extraction, semantic analysis, and reconstruction as an offline preprocessing step. Evaluated on ChatDoctor and Wiki-PII datasets across six LLMs, the approach reduces targeted information exposure in LLaMA-3-8B from 144 baseline instances to 1, while maintaining contextual fidelity (BLEU-1 of 0.122 vs. SAGE's 0.117). The framework introduces no additional online inference latency since rewriting is done offline. Source code is publicly released.

AI Safety Research Enterprise Deployment Patterns Privacy-Preserving RAG via Multi-Agent Semantic Rewriting Wiki-PII SAGE +2 more

4arXiv · cs.CL·4d ago·source ↗

Psy-CoT and RAPO: Psychology-grounded reasoning and role-aware RL for character-faithful role-playing agents

Researchers propose Psy-CoT, a chain-of-thought framework that decomposes role-playing reasoning into three psychology-grounded steps (Interaction Perception, Psychological Empathy, Logical Construction) to improve out-of-distribution generalization beyond surface mimicry. They also introduce Role-Aware Policy Optimization (RAPO), a reinforcement learning method that uses profile–token mutual information to weight gradients asymmetrically, addressing reward hacking where generic phrases receive the same signal as role-specific ones. Experiments on CoSER, CharacterBench, and CharacterEval show Psy-CoT outperforms existing role-playing CoT methods and RAPO consistently beats GRPO across model scales. The work addresses a known failure mode of SFT-based role-playing agents and proposes a targeted RL fix for reward model exploitation.

Agent and Tool Ecosystem Alignment and RLHF GRPO CharacterEval Psy-CoT +3 more

6arXiv · cs.AI·18d ago·source ↗

RA-RFT: Retrieval-Augmented Reinforcement Fine-Tuning teaches LLMs to reason by analogy

Researchers propose Retrieval-Augmented Reinforcement Fine-Tuning (RA-RFT), a post-training framework that trains a retriever to rank contexts by expected reasoning benefit rather than semantic similarity, then fine-tunes a policy model via reinforcement learning using retrieved analogous demonstrations. The key insight is that reasoning-relevant retrieval surfaces complementary solution strategies rather than superficially similar problems. On mathematical reasoning benchmarks, RA-RFT improves AIME 2025 average@32 accuracy by 7.1 and 2.8 points over GRPO for Qwen3-1.7B and Qwen3-4B respectively, suggesting reasoning-aware retrieval is orthogonal to reward design and training curriculum improvements.

Evaluation and Benchmarking Alignment and RLHF RA-RFT Learning to Reason by Analogy via Retrieval-Augmented Reinforcement Fine-Tuning GRPO +3 more

5arXiv · cs.CL·20d ago·source ↗

ADAS: Attention-Discounted Adaptive Sampler improves parallel decoding for masked diffusion language models

Researchers propose ADAS, a training-free reranking rule for masked diffusion language model decoding that addresses token interaction failures in parallel token commitment. The method greedily penalizes candidates that attend strongly to already-selected uncertain positions, using attention weights as soft marginal penalties rather than hard constraints. Evaluated on LLaDA-8B-Base and Dream-7B-Base across GSM8K, MATH500, HumanEval, and MBPP, ADAS improves low-NFE performance by 9–10 percentage points on average when plugged into existing samplers with only 3.1% runtime overhead.

Frontier Model Releases Inference Economics LLaDA-8B-Base MATH500 EB-Sampler +6 more

5arXiv · cs.CL·14d ago·source ↗

ASRD: Training-free anchor-guided revocable decoding for diffusion LLMs improves accuracy and throughput

A new arXiv preprint introduces ASRD (Anchor Supervised Revocable Decoding), a training-free framework for improving decoding quality in diffusion large language models. The method addresses error propagation and local error reinforcement in revocable decoding by separating trusted 'anchor tokens' (identified via temporal consistency) from uncertain candidates, then applying anchor-guided generation and anchor-perturbed verification. Experiments on math and coding benchmarks show up to 6.4% accuracy improvement and 7.2× inference throughput gains over remasking baselines.

Inference Economics ASRD Follow the Latent Roadmap: Navigating Revocable Decoding for Diffusion LLMs with Anchor Tokens