0

TAC TOE RL Env (Community)

Fresh

Multi-turn tic-tac-toe against a GTO (minimax) opponent. Model plays as X or O randomly, first mover randomized. Win=1.0, Draw=0.5, Loss/Illegal=0.0.

Type
RL Env
Runtime
multi-turn
License
unknown
Size
v0.1.2
Published
Mar 2026

Cite

Notes

Only stored in your browser.

Contributors

1