AIME 2024: Problems from the American Invitational Mathematics Examination
Saturated
Official 15-problem high-school math olympiad-track exam used by labs as a fresh, contamination-resistant math reasoning benchmark.
- Publisher
- Mathematical Association of America
- Domain
- math
- Format
- Custom
- Size
- 30 tasks
- License
- Unknown
- Published
- May 2026
- Notable for
- Benchmark for evaluating math and planning in the math domain.
Cite
Notes
Only stored in your browser.
Top score 96.7% by o3 - 172 models reporting (42 frontier)
Score history
172Top models
172Where it's ranked
2Related tools
10Implementations, trainers, datasets and scaffolds linked to this eval.
Papers
2FAQ
- What is AIME 2024: Problems from the American Invitational Mathematics Examination?
- Official 15-problem high-school math olympiad-track exam used by labs as a fresh, contamination-resistant math reasoning benchmark.
- What capabilities does AIME 2024: Problems from the American Invitational Mathematics Examination test?
- AIME 2024: Problems from the American Invitational Mathematics Examination evaluates math, planning.
- What is the current top score on AIME 2024: Problems from the American Invitational Mathematics Examination?
- The top reported score is 96.7% by o3, across 172 models reporting (42 from frontier labs).
- How can a model improve its AIME 2024: Problems from the American Invitational Mathematics Examination score?
- Tools linked to AIME 2024: Problems from the American Invitational Mathematics Examination on Sophon include AIME 2024 RL Env (Prime Intellect), Hermes Example RL Env (Community), Verifiers Math (math-python), Deepscaler RL Env (Prime Intellect) - RL environments, datasets, and scaffolds that target this eval.
- What license is AIME 2024: Problems from the American Invitational Mathematics Examination under?
- AIME 2024: Problems from the American Invitational Mathematics Examination is available under Unknown.
