Almanac
technique

benchmaxxing

techniqueactivebenchmaxxing-7e6cb04b·1 events·first seen 1mo ago

Aliases: benchmaxxing

Co-occurring entities

More like this (12)

Recent events (1)

4Hugging Face Blog·1mo ago·source ↗

Adding Benchmaxxer Repellant to the Open ASR Leaderboard

Hugging Face describes measures taken to prevent benchmark gaming ('benchmaxxing') on the Open ASR Leaderboard by introducing private or held-out evaluation data. The post addresses the integrity of automatic speech recognition benchmarks, where models may be overfitted or tuned specifically to public test sets. This is part of a broader effort to maintain meaningful leaderboard rankings as ASR model submissions increase.