0

Text Quests

Classic Infocom interactive fiction games (Zork, Enchanter, etc.) for evaluating LLM reasoning, planning, and world modeling

Domain
rl-env
License
unknown
Published
Mar 2026

Cite

Notes

Only stored in your browser.

Attribution

Leaderboard scores
prime-hub
Attribution policy →

Top score 18.3% by Grok 4 Fast - 2 models reporting (2 frontier)

Score history

2
0%25%50%75%100%Aug 25Sep 25GPT-5 MiniGrok 4 Fast

Top models

2
Text QuestsBar chart with 2 bars. Highest value: Grok 4 Fast at 18.3.
2 models

Related tools

1
View all

Implementations, trainers, datasets and scaffolds linked to this eval.

FAQ

What is Text Quests?
Classic Infocom interactive fiction games (Zork, Enchanter, etc.) for evaluating LLM reasoning, planning, and world modeling
What is the current top score on Text Quests?
The top reported score is 18.3% by Grok 4 Fast, across 2 models reporting (2 from frontier labs).
How can a model improve its Text Quests score?
Tools linked to Text Quests on Sophon include TEXT Quests RL Env (Prime Community) - RL environments, datasets, and scaffolds that target this eval.
What license is Text Quests under?
Text Quests is available under unknown.