g8967 is an RL env contributor.
Cite
Notes
Only stored in your browser.
Attribution
Settlers of Catan environment for RL training with LLMs
Pacman environment for multimodal RL training with verifiers
PuzzleJAX environment integration for verifiers
Snake game environment for RL training with multi-turn interaction