Almanac
benchmark

AIME2024

benchmarkactiveprovisionalaime2024-edb7805b·1 events·first seen 15d ago

Aliases: AIME2024

Co-occurring entities

More like this (12)

Recent events (1)

8Mistral Ai News·15d ago·source ↗

Mistral AI Releases Magistral: First Reasoning Model in Open and Enterprise Variants

Mistral AI announces Magistral, its first reasoning model, released in two variants: Magistral Small (24B parameters, open-weight, Apache 2.0) and Magistral Medium (enterprise, closed). Magistral Medium scores 73.6% on AIME2024 (90% with majority voting @64), while Magistral Small scores 70.7% (83.3% respectively). Key differentiators include native multilingual chain-of-thought reasoning across eight major languages, transparent traceable reasoning steps, and up to 10x faster token throughput in Le Chat via Flash Answers. The release is accompanied by a research paper covering training infrastructure, reinforcement learning algorithm, and novel observations for training reasoning models.