Anchoring Trap
Fresh
A reward hacking sprint environment for anchoring under corrective evidence.
- Type
- RL Env
- Tags
- Reward Hacking
- Runtime
single-turn- License
- unknown
- Size
- v0.1.1
- Published
- Jun 2026
Cite
Notes
Only stored in your browser.
A reward hacking sprint environment for anchoring under corrective evidence.
single-turnCite
Notes
Only stored in your browser.