Entity · technique

Fast-dLLM

techniqueactivefast-dllm-f5782055·1 events·first seen Jun 10, 2026

Aliases: Fast-dLLM

Co-occurring entities

LLaDA-8B-Base MATH500 EB-Sampler ADAS HumanEval MBPP Dream-7B-Base GSM8K

More like this (12)

Mesh LLM StreamingLLM PortLLM MDLM DLAM dLLM-Prover-7B RTLLM LiteLLM EvalLLM DiaLLM CO-LMLM long-context LLMs

Recent events (1)

5arXiv · cs.CL·Jun 10, 2026·source ↗

ADAS: Attention-Discounted Adaptive Sampler improves parallel decoding for masked diffusion language models

Researchers propose ADAS, a training-free reranking rule for masked diffusion language model decoding that addresses token interaction failures in parallel token commitment. The method greedily penalizes candidates that attend strongly to already-selected uncertain positions, using attention weights as soft marginal penalties rather than hard constraints. Evaluated on LLaDA-8B-Base and Dream-7B-Base across GSM8K, MATH500, HumanEval, and MBPP, ADAS improves low-NFE performance by 9–10 percentage points on average when plugged into existing samplers with only 3.1% runtime overhead.

Frontier Model Releases Inference Economics LLaDA-8B-Base MATH500 EB-Sampler +6 more