Almanac
paper

Ethical and Technical Limits of Deepfake Speech Datasets

paperactiveprovisionalethical-and-technical-limits-of-deepfake-speech-datasets-a8752504·1 events·first seen 7d ago

Aliases: Ethical and Technical Limits of Deepfake Speech Datasets

More like this (12)

Recent events (1)

5arXiv · cs.AI·7d ago·source ↗

Audit of 39 deepfake speech datasets reveals fairness and generalization gaps

A dataset-level audit of 39 deepfake speech datasets examines accessibility, documentation, demographic coverage, scale, and source corpora. The study finds that fairness assessment is largely infeasible due to missing demographic metadata, and that substantial overlap in underlying speech corpora across datasets undermines cross-dataset evaluation and inflates generalization claims. The findings challenge the credibility of robustness and fairness claims made for deepfake speech detectors.