Almanac
benchmark

First Proof

benchmarkactivefirst-proof-4ae2d4cd·1 events·first seen 28d ago

Aliases: First Proof

Co-occurring entities

More like this (12)

Recent events (1)

6Openai Blog·28d ago·source ↗

OpenAI Shares First Proof Math Challenge Submissions

OpenAI has published its AI model's proof attempts for the First Proof math challenge, a competition designed to test research-grade mathematical reasoning on expert-level problems. This represents a capability demonstration of OpenAI's models on formal mathematical proof generation. The submission signals continued progress in AI mathematical reasoning at a level approaching or engaging with professional research mathematics.