AI Lab Tracker
Labs
Timeline
miniF2F-test (Rectified)
dataset
2025-04-15
Moonshot AI
Cleaned and corrected version of the popular formal reasoning benchmark, released alongside Kimina-Prover.
GitHub
benchmark
reasoning
Related
kimina-prover