AI RESEARCH

Formal Conjectures: An Open and Evolving Benchmark for Verified Discovery in Mathematics

arXiv CS.AI

ArXi:2605.13171v1 Announce Type: new As automated reasoning systems advance rapidly, there is a growing need for research-level formal mathematical problems to accurately evaluate their capabilities. To address this, we present Formal Conjectures, an evolving benchmark of currently 2615 mathematical problem statements formalized in Lean 4. Sourced from areas of active mathematical research, the dataset features 1029 open research conjectures providing a zero-contamination benchmark for mathematical proof discovery, and 836 solved problems for proof autoformalization.