FrontierMath
Frontier
Unpublished collection of research-level mathematics problems written by professional mathematicians, designed to be the hardest open math benchmark.
- Publisher
- Epoch AI
- Capabilities
- MathScientific Reasoning
- Domain
- math
- Format
- Custom
- Size
- 300 tasks
- License
- Closed
- Published
- Nov 2024
- Updates
- Monthly
- Notable for
- The reference frontier-math benchmark — problems are kept private to prevent contamination, and even top reasoning models scored under 10% at release.
- Canonical
- epoch.ai/frontiermath
- Official leaderboard
- epoch.ai/frontiermath
Cite
Notes
Only stored in your browser.
Top score 51.7% by GPT-5.5 - 63 models reporting (34 frontier)
Score history
52Top models
63Where it's ranked
1Papers
2Contributors
2FAQ
- What is FrontierMath?
- Unpublished collection of research-level mathematics problems written by professional mathematicians, designed to be the hardest open math benchmark.
- What capabilities does FrontierMath test?
- FrontierMath evaluates math, scientific reasoning.
- What is the current top score on FrontierMath?
- The top reported score is 51.7% by GPT-5.5, across 63 models reporting (34 from frontier labs).
- What license is FrontierMath under?
- FrontierMath is available under Closed.

