0

Sophistry Bench Sprint

Fresh

Single-agent advocacy variant of sophistry-bench for the Prime Intellect Reward Hacking Sprint. Pre-registered hypothesis: training Llama-3.2-1B on...

Type
RL Env
License
unknown
Size
v0.1.4
Published
May 2026

Cite

Notes

Only stored in your browser.

Contributors

1