Sophistry Bench Sprint
Fresh
Single-agent advocacy variant of sophistry-bench for the Prime Intellect Reward Hacking Sprint. Pre-registered hypothesis: training Llama-3.2-1B on...
- Type
- RL Env
- License
- unknown
- Size
- v0.1.4
- Published
- May 2026
Cite
Notes
Only stored in your browser.