Entity · benchmark

IMO 2025

benchmarkactiveimo-2025-a96f25b2·2 events·first seen May 18, 2026

Aliases: IMO 2025

Co-occurring entities

MiniMax USAMO 2026 MaxProof DeepSeek V4 Gemini-3.0-Pro ICPC World Finals Hugging Face IOI 2025 DeepSeek-V3.2-Speciale GPT-5.5

More like this (12)

IOI 2025 IMO AIME 2025 ICAIS 2025 ISCA 2025 GTC 2025 HMMT 2025 EC 2025 Google I/O 2026 Algonauts 2025 AIME 2026 HMMT 2026

Recent events (2)

9arXiv · cs.CL·Jun 12, 2026·source ↗

MaxProof achieves gold-medal-level performance on IMO 2025 and USAMO 2026 via population-level test-time scaling

MiniMax introduces MaxProof, a test-time scaling framework for competition-level mathematical proof built on their MiniMax-M3 model. The system trains three capabilities — proof generation, verification, and critique-conditioned repair — then at inference time runs tournament selection over a population of candidate proofs. MaxProof scores 35/42 on IMO 2025 and 36/42 on USAMO 2026, exceeding the human gold-medal threshold on both competitions.

Frontier Model Releases Evaluation and Benchmarking MiniMax USAMO 2026 MaxProof +2 more

8Deepseek News·May 18, 2026·source ↗

DeepSeek-V3.2 and V3.2-Speciale Released: Reasoning-First Models with Agent Tool-Use Integration

DeepSeek has released two new open-weights models: DeepSeek-V3.2, the official successor to V3.2-Exp with balanced reasoning and tool-use capabilities, and DeepSeek-V3.2-Speciale, a maxed-out reasoning variant claiming gold-medal performance on IMO, CMO, ICPC World Finals, and IOI 2025. V3.2 is the first DeepSeek model to integrate chain-of-thought thinking directly into tool-use workflows, trained on a new agent data synthesis pipeline covering 1,800+ environments and 85k+ complex instructions. V3.2-Speciale is API-only with no tool-call support, available via a temporary endpoint expiring December 15, 2025, while both models are open-sourced on Hugging Face with an accompanying technical report.

Frontier Model Releases Evaluation and Benchmarking DeepSeek V4 Gemini-3.0-Pro ICPC World Finals +8 more