0

Gsm8k Multireward RL Env (Community)

Fresh

GSM8K with multi-reward support (correctness + length, optional gating)

Type
RL Env
Capabilities
Math
Runtime
single-turn
License
apache-2.0
Size
v0.2.0
Published
Jan 2026

Cite

Notes

Only stored in your browser.

Lift evidence

1
EvalTools known to liftSource paper
GSM8KGsm8k Multireward RL Env (Community)-

Contributors

1