benchmark
Hallucinations Leaderboard
benchmarkactive
hallucinations-leaderboard-5f0833ed·1 events·first seen 29d agoAliases: Hallucinations Leaderboard
Co-occurring entities
More like this (12)
Recent events (1)
The Hallucinations Leaderboard, an Open Effort to Measure Hallucinations in Large Language Models
Hugging Face has launched an open leaderboard specifically designed to benchmark hallucination rates across large language models. The effort aims to standardize evaluation of factual accuracy and confabulation tendencies, filling a gap in existing benchmarks that focus primarily on capability rather than reliability. The leaderboard is positioned as a community-driven, transparent resource for tracking model trustworthiness.