benchmark
AIME2024
benchmarkactiveprovisional
aime2024-edb7805b·1 events·first seen 15d agoAliases: AIME2024
Co-occurring entities
More like this (12)
Recent events (1)
Mistral AI Releases Magistral: First Reasoning Model in Open and Enterprise Variants
Mistral AI announces Magistral, its first reasoning model, released in two variants: Magistral Small (24B parameters, open-weight, Apache 2.0) and Magistral Medium (enterprise, closed). Magistral Medium scores 73.6% on AIME2024 (90% with majority voting @64), while Magistral Small scores 70.7% (83.3% respectively). Key differentiators include native multilingual chain-of-thought reasoning across eight major languages, transparent traceable reasoning steps, and up to 10x faster token throughput in Le Chat via Flash Answers. The release is accompanied by a research paper covering training infrastructure, reinforcement learning algorithm, and novel observations for training reasoning models.