AIME 2025: Problems from the American Invitational Mathematics Examination
Saturated
A benchmark for evaluating AI's ability to solve challenging mathematics problems from the 2025 AIME - a prestigious high school mathematics competition.
- Publisher
- Mathematical Association of America
- Domain
- Mathematics
- License
- mit
- Published
- Oct 2025
- Notable for
- Benchmark for evaluating Mathematics.
Cite
Notes
Only stored in your browser.
Top score 98.7% by GPT-5 Codex - 207 models reporting (45 frontier)
Score history
207Top models
207Related tools
5Implementations, trainers, datasets and scaffolds linked to this eval.
FAQ
- What is AIME 2025: Problems from the American Invitational Mathematics Examination?
- A benchmark for evaluating AI's ability to solve challenging mathematics problems from the 2025 AIME - a prestigious high school mathematics competition.
- What is the current top score on AIME 2025: Problems from the American Invitational Mathematics Examination?
- The top reported score is 98.7% by GPT-5 Codex, across 207 models reporting (45 from frontier labs).
- How can a model improve its AIME 2025: Problems from the American Invitational Mathematics Examination score?
- Tools linked to AIME 2025: Problems from the American Invitational Mathematics Examination on Sophon include AIME 2025 RL Env (Dev Team), AIME 2025 RL Env (Prime Intellect), AIME 2025 RL Env (Community), VF Openbench RL Env (Community) - RL environments, datasets, and scaffolds that target this eval.
- What license is AIME 2025: Problems from the American Invitational Mathematics Examination under?
- AIME 2025: Problems from the American Invitational Mathematics Examination is available under mit.



