benchmark
Open Chain of Thought Leaderboard
benchmarkactive
open-chain-of-thought-leaderboard-86376f78·1 events·first seen 28d agoAliases: Open Chain of Thought Leaderboard
Co-occurring entities
More like this (12)
Recent events (1)
Introducing the Open Chain of Thought Leaderboard
Hugging Face has launched the Open Chain of Thought Leaderboard, a benchmarking platform specifically designed to evaluate open-weight language models on chain-of-thought reasoning capabilities. The leaderboard tracks model performance across reasoning-intensive tasks that require multi-step inference. This initiative aims to provide standardized, reproducible comparisons of CoT reasoning quality across the open-weights ecosystem.