Reward Hacking RL Env (Community)
Fresh
Reward hacking sprint environment for rubric/checklist compliance without semantic correctness.
- Type
- RL Env
- Tags
- Reward Hacking
- Runtime
single-turn- License
- unknown
- Size
- v0.1.0
- Published
- May 2026
Cite
Notes
Only stored in your browser.