0

Alphabet Sort

Toy multi-turn environment where the model must maintain and update an alphabetically sorted list of names across turns; the canonical "hello world" RL env on the Prime Intellect Hub.

Type
RL Env
Runtime
verifiers
License
MIT
Size
1 env, procedurally generated tasks (configurable difficulty / max_turns)
Published
Jan 2025

Cite

Notes

Only stored in your browser.