sivit is an RL env contributor.
Cite
Notes
Only stored in your browser.
Attribution
Easy prose arithmetic with last-number verification
Easy percentage word problems with last-number verification
Easy unit-rate word problems with last-number verification
Countdown arithmetic environment with plain prompts and permissive expression parsing
GSM8K environment with a permissive last-number verifier