Almanac
benchmark

Open Chain of Thought Leaderboard

benchmarkactiveopen-chain-of-thought-leaderboard-86376f78·1 events·first seen 28d ago

Aliases: Open Chain of Thought Leaderboard

Co-occurring entities

More like this (12)

Recent events (1)

5Hugging Face Blog·28d ago·source ↗

Introducing the Open Chain of Thought Leaderboard

Hugging Face has launched the Open Chain of Thought Leaderboard, a benchmarking platform specifically designed to evaluate open-weight language models on chain-of-thought reasoning capabilities. The leaderboard tracks model performance across reasoning-intensive tasks that require multi-step inference. This initiative aims to provide standardized, reproducible comparisons of CoT reasoning quality across the open-weights ecosystem.