AIME2024
Fresh
Problems from the American Invitational Mathematics Examination (AIME) 2024.
- Type
- RL Env
- Publisher
- General Reasoning
- Runtime
ORS- License
- unknown
- Size
- 30 tasks
- Published
- Jan 2026
Cite
Notes
Only stored in your browser.
Public scores on this env
1317 vf-eval reports across 13 models
1GPT-5 pro (python)OpenAI1002o1OpenAI963o3OpenAI91.64Qwen 3 Coder NextAlibaba89.015R1DeepSeek79.86OpenAI-o1-091274.47Magistral MediumMistral AI73.68DeepSeek R1-ZeroDeepSeek719OpenAI-o1-mini63.610o3 MiniOpenAIdisputed6011DAPO5012GPT-4oOpenAI3813Mistral Medium 3Mistral AI26.8
Open the scoring view →