Almanac
benchmark

Hallucinations Leaderboard

benchmarkactivehallucinations-leaderboard-5f0833ed·1 events·first seen 29d ago

Aliases: Hallucinations Leaderboard

Co-occurring entities

More like this (12)

Recent events (1)

5Hugging Face Blog·29d ago·source ↗

The Hallucinations Leaderboard, an Open Effort to Measure Hallucinations in Large Language Models

Hugging Face has launched an open leaderboard specifically designed to benchmark hallucination rates across large language models. The effort aims to standardize evaluation of factual accuracy and confabulation tendencies, filling a gap in existing benchmarks that focus primarily on capability rather than reliability. The leaderboard is positioned as a community-driven, transparent resource for tracking model trustworthiness.