Almanac
technique

Test-time Compute Search

techniqueactivetest-time-compute-search-70e142a6·1 events·first seen 28d ago

Aliases: Test-time Compute Search

Co-occurring entities

More like this (12)

Recent events (1)

6Hugging Face Blog·28d ago·source ↗

Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models

Kimina-Prover is a new large formal reasoning model that combines reinforcement learning with test-time search to improve mathematical theorem proving. The approach applies RL-trained search strategies at inference time, targeting formal proof generation in systems like Lean. The work is published via the AI-MO (AI for Math Olympiad) team on Hugging Face, continuing the trend of applying RL and extended compute at test time to hard reasoning tasks.