0

Review Gaming

Fresh

Reward hacking sprint: OOD transfer probe. A security code-review audit ([REVIEW: PASS] tags) used to test whether a model trained to game the sla-...

Type
RL Env
License
unknown
Size
v0.1.0
Published
May 2026

Cite

Notes

Only stored in your browser.

Contributors

1