setrf is an RL env contributor.
Cite
Notes
Only stored in your browser.
Attribution
Trainable multi-turn Megaminx spatial reasoning environment for Prime Intellect Verifiers.
Traveling Salesman (TSP) environment for verifiers/Prime-RL.