Lisanbench RL Env (Prime Intellect)
Fresh
Single-turn evaluation where the model is tasked to generate the longest valid chain of 1-word edits from a given starting word. The final score is the sum of the longest valid chains across all starting words.
- Type
- RL Env
- Publisher
- Prime Intellect
- Tags
- Word Game
- Runtime
single-turn- License
- unknown
- Size
- v0.1.2
- Published
- Sep 2025
Cite
Notes
Only stored in your browser.
Public scores on this env
11 vf-eval report across 1 model