Entity · dataset

SpeechMatrix

datasetactivespeechmatrix-8cde13e6·1 events·first seen Jun 12, 2026

Aliases: SpeechMatrix

Co-occurring entities

Leveraging Audio-LLMs to Filter Speech-to-Speech Training Data CVSS-C Rank-to-Distill

More like this (12)

CapSpeech-TTS WikiMatrix SpeechEQ Symphony for Speech-to-Text Matrix-Game Speech-to-Speech speech-to-avatar systems MMS (Massively Multilingual Speech)Interleaved Speech Language Models Latently Work In Text Voxtral TTS SpeechT5 WordVoice

Recent events (1)

4arXiv · cs.CL·Jun 12, 2026·source ↗

Audio-LLM-based data filtering for speech-to-speech translation via Rank-to-Distill

A new arXiv paper proposes using audio large language models to filter noisy training data for end-to-end speech-to-speech translation (S2ST). The authors introduce a two-stage Rank-to-Distill strategy: a lightweight ranker generates pseudo-labels from noisy speech pairs, which then supervise an audio-LLM to make keep/drop decisions directly from raw audio. Experiments on CVSS-C and SpeechMatrix benchmarks show up to +1.4 ASR-BLEU improvement over unfiltered baselines.

Evaluation and Benchmarking Multimodal Progress Leveraging Audio-LLMs to Filter Speech-to-Speech Training Data SpeechMatrix CVSS-C +1 more