0

SELF Reward RL Env (Community)

Fresh

Environment models self-rewarding their own responses

Type
RL Env
Runtime
single-turn
License
unknown
Size
v0.1.1
Published
Aug 2025

Cite

Notes

Only stored in your browser.

Contributors

1