paper
Efficient ASR Training with Conversations that Never Happened
paperactiveprovisional
efficient-asr-training-with-conversations-that-never-happened-ba226706·1 events·first seen 13d agoAliases: Efficient ASR Training with Conversations that Never Happened
Co-occurring entities
More like this (12)
Learning to Hear Hesitation: Continual Learning for Disfluency-Aware ASRLeveraging Audio-LLMs to Filter Speech-to-Speech Training DataContext-Driven Incremental Compression for Multi-Turn Dialogue GenerationContext-Driven Incremental Compression for Multi-Turn Dialogue GenerationArtificial Analysis Conversational DynamicsDetecting Knowledge Gaps from Conversational AI Interactions Using Curriculum Prerequisite GraphsReference-Augmented TrainingAttention Amnesia in Hybrid LLMs: When CoT Fine-Tuning Breaks Long-Range Recall, and How to Fix ItLanguage Modeling LossRAT: Reference-Augmented Training for ASV Anti-SpoofingAdaptive Turn-Taking for Real-time Multi-Party Voice Agentscounterfactual text generation
Recent events (1)
Synthetic LLM-generated conversations improve ASR training for low-resource languages
Researchers propose a pipeline that uses LLMs to generate scenario-level dialogues and TTS to synthesize multi-speaker audio, creating simulated conversational training data for ASR systems. Evaluated on the Hungarian BEA-Dialogue benchmark, a model trained on 67 hours of real plus 636 hours of synthetic data outperforms a zero-shot model trained on 2,700 hours of real Hungarian speech. The study tests five LLM families under multiple budget and mixing configurations using a FastConformer-Large backbone, finding that generator choice and data composition significantly affect gains.