Reward Hacking RL Env (Community)
Fresh
Reward Hacking Sprint v3: Enhanced environment with true metrics tracking, harder proxy traps, and hacking detection. 12 proxy rewards (4 traps) + ...
- Type
- RL Env
- Runtime
single-turn- License
- unknown
- Size
- v3.0.0
- Published
- May 2026
Cite
Notes
Only stored in your browser.
Public scores on this env
44 vf-eval reports across 4 models
1MiMo-V2.5-ProXiaomi9.482Llama 3.2 Instruct 1BMeta Platforms8.963GLM 5.1Zai5.244GPT-5 MiniOpenAI4.95
Open the scoring view →