paper
Self-Augmenting Retrieval for Diffusion Language Models
paperactiveprovisional
self-augmenting-retrieval-for-diffusion-language-models-b1d03fa5·1 events·first seen 12d agoAliases: Self-Augmenting Retrieval for Diffusion Language Models
Co-occurring entities
More like this (12)
Diffusion Language ModelsLESS: Mutual-Stability Sampling for Diffusion Language ModelsKnowledge Editing in Masked Diffusion Language ModelsBeyond Fully Random Masking: Attention-Guided Denoising and Optimization for Diffusion Language Modelscontinuous diffusion language modeldiffusion-based generative modelsRepresentation-Conditioned Diffusion ModelsWhen Does Mixing Help? Analyzing Query Embedding Interpolation in Multilingual Dense RetrievalMasked Diffusion ModelsA Diffusion Approximation for Temporal-Difference Learning with Linear Features under Markovian NoiseDenoising Diffusion Probabilistic ModelsReinforcement Learning for Language Models
Recent events (1)
SARDI: Self-Augmenting Retrieval for Diffusion Language Models using lookahead tokens
Researchers introduce SARDI, a training-free RAG framework for discrete diffusion language models that repurposes discarded low-confidence tokens during denoising as lookahead signals to guide retrieval before output is finalized. The method is retriever-agnostic and applicable to any reasoning-capable discrete diffusion LM. Evaluated across five multi-hop QA benchmarks, SARDI outperforms training-free diffusion and autoregressive retrieval baselines at up to 8x higher throughput.