nwyin is an RL env contributor.
Cite
Notes
Only stored in your browser.
Attribution
RL environment using slop-guard as a continuous reward signal for anti-slop prose generation