Verifiers Math (math-python)
Multi-turn math problem-solving environment where the model proposes Python code in a sandbox to compute and verify numerical answers.
- Type
- RL Env
- Publisher
- Prime Intellect
- Capabilities
- Code GenerationMathTool Calling
- Runtime
verifiers- License
- MIT
- Size
- 1 env, thousands of problems (MATH, GSM8K, AIME-style)
- Published
- Jan 2025
Cite
Notes
Only stored in your browser.
Lift evidence
2| Eval | Tools known to lift | Source paper |
|---|---|---|
| AIME 2024: Problems from the American Invitational Mathematics Examination | Verifiers Math (math-python) | - |
| GSM8K | Verifiers Math (math-python) | - |
Evals this tool implements
1Same problem set, this tool's harness. Run it to score a model on the test.
Models
Compatible
any