Goldilocks Ifeval RL Env (Community)
Fresh
FIXED: Adaptive controller for reward hacking. Monitors visible delta AND hidden reward. Adapts check count 7→9. Original was bugged (blind to hidd...
- Type
- RL Env
- Runtime
single-turn- License
- apache-2.0
- Size
- v0.1.12
- Published
- May 2026
Cite
Notes
Only stored in your browser.
Evals this tool implements
1Same problem set, this tool's harness. Run it to score a model on the test.