P2p Gsm8k RL Env (Sarvam AI Team)
Fresh
GSM8K (grade-school math) environment with last-number verification, built on verifiers>=0.1.12. Single-turn: solve a word problem, ends-with-the-f...
- Type
- RL Env
- Publisher
- Sarvam AI Team
- Capabilities
- Math
- Runtime
multi-turn- License
- unknown
- Size
- v0.1.0
- Published
- May 2026
Cite
Notes
Only stored in your browser.
Evals this tool implements
1Same problem set, this tool's harness. Run it to score a model on the test.