Almanac
technique

DOA (Decoder-Only Attention)

techniqueactiveprovisionaldoa-decoder-only-attention--d5b98c0f·1 events·first seen 16d ago

Aliases: DOA (Decoder-Only Attention)

Co-occurring entities

More like this (12)

Recent events (1)

6arXiv · cs.CL·16d ago·source ↗

DOA: Training-Free Decoder-Only Attention Policy for Long-Form Simultaneous Speech Translation with SpeechLLMs

The paper proposes Decoder-Only Attention (DOA), a training-free streaming policy for simultaneous speech-to-text translation (SimulST) that works with off-the-shelf decoder-only Speech LLMs. DOA derives proxy alignment signals from self-attention rather than cross-attention, enabling long-form simultaneous translation without retraining. Experiments on Phi4-Multimodal and Qwen3-Omni demonstrate low-latency performance approaching offline decoding quality, validating that decoder self-attention contains sufficient alignment information for streaming decisions.