Entity · paper

Self-Augmenting Retrieval for Diffusion Language Models

paperactiveself-augmenting-retrieval-for-diffusion-language-models-b1d03fa5·1 events·first seen Jun 5, 2026

Aliases: Self-Augmenting Retrieval for Diffusion Language Models

Co-occurring entities

More like this (12)

Recent events (1)

5arXiv · cs.LG·Jun 5, 2026·source ↗

SARDI: Self-Augmenting Retrieval for Diffusion Language Models using lookahead tokens

Researchers introduce SARDI, a training-free RAG framework for discrete diffusion language models that repurposes discarded low-confidence tokens during denoising as lookahead signals to guide retrieval before output is finalized. The method is retriever-agnostic and applicable to any reasoning-capable discrete diffusion LM. Evaluated across five multi-hop QA benchmarks, SARDI outperforms training-free diffusion and autoregressive retrieval baselines at up to 8x higher throughput.

Evaluation and Benchmarking Agent and Tool Ecosystem Self-Augmenting Retrieval for Diffusion Language Models SARDI