Entity · model

SDAR

modelactivesdar-4de1ca69·1 events·first seen Jun 2, 2026

Aliases: SDAR

Co-occurring entities

KV Cache speculative decoding Diffusion Language Models SimSD blockwise decoding

More like this (12)

ASRD SARDI ADAS DARP AdaSR SARA SECDA-DSE DEFAR SEDD SAIR APS-RAG ARPA-H

Recent events (1)

6arXiv · cs.AI·Jun 2, 2026·source ↗

SimSD: Speculative Decoding Adapted for Diffusion Language Models

SimSD introduces a training-free speculative decoding algorithm for diffusion large language models (dLLMs), which previously could not use standard token-level speculative decoding due to their bidirectional attention and masked language modeling formulation. The method uses a plug-and-play masking strategy that introduces reference tokens from a draft model and a custom attention mask, enabling valid logit computation for drafted tokens in a single forward pass. Evaluated on SDAR-family dLLMs across four benchmarks, SimSD achieves up to 7.46x decoding throughput improvement while maintaining or improving generation quality. The approach is compatible with other acceleration techniques such as KV cache and blockwise decoding.

Frontier Model Releases Inference Economics KV Cache speculative decoding SDAR +4 more