0

P2p Gsm8k RL Env (Sarvam AI Team)

Fresh

GSM8K (grade-school math) environment with last-number verification, built on verifiers>=0.1.12. Single-turn: solve a word problem, ends-with-the-f...

Type
RL Env
Capabilities
Math
Runtime
multi-turn
License
unknown
Size
v0.1.0
Published
May 2026

Cite

Notes

Only stored in your browser.

Evals this tool implements

1

Same problem set, this tool's harness. Run it to score a model on the test.