0

FrontierMath

Frontier

Unpublished collection of research-level mathematics problems written by professional mathematicians, designed to be the hardest open math benchmark.

Publisher
Epoch AI
Domain
math
Format
Custom
Size
300 tasks
License
Closed
Published
Nov 2024
Updates
Monthly
Notable for
The reference frontier-math benchmark — problems are kept private to prevent contamination, and even top reasoning models scored under 10% at release.
Official leaderboard
epoch.ai/frontiermath

Cite

Notes

Only stored in your browser.

Attribution

Leaderboard scores
frontiermath
Attribution policy →

Top score 51.7% by GPT-5.5 - 63 models reporting (34 frontier)

Score history

52
0%25%50%75%100%Dec 23Jun 24Dec 24Jun 25Dec 25Mistral Mediumo1 Minio1o3GPT-5Gemini 3 ProGPT-5.4GPT-5.5

Top models

63
FrontierMathBar chart with 21 bars. Highest value: GPT-5.5 at 51.7.
21 models

Where it's ranked

1

Papers

2

Contributors

2

FAQ

What is FrontierMath?
Unpublished collection of research-level mathematics problems written by professional mathematicians, designed to be the hardest open math benchmark.
What capabilities does FrontierMath test?
FrontierMath evaluates math, scientific reasoning.
What is the current top score on FrontierMath?
The top reported score is 51.7% by GPT-5.5, across 63 models reporting (34 from frontier labs).
What license is FrontierMath under?
FrontierMath is available under Closed.