0

Mastermind

Single-turn Mastermind environment with information-gain / elimination scoring

Domain
rl-env
License
unknown
Published
Sep 2025

Cite

Notes

Only stored in your browser.

Attribution

Leaderboard scores
prime-hub
Attribution policy →

Top score 29.1% by Qwen3 Next 80B A3B Instruct - 6 models reporting (1 frontier)

Score history

6
0%25%50%75%100%Sep 24Dec 24Mar 25Jun 25Sep 25Qwen2.5 7B InstructQwen3 Next 80B A3B Instruct

Top models

6
MastermindBar chart with 6 bars. Highest value: Qwen3 Next 80B A3B Instruct at 29.1.
6 models

Related tools

1
View all

Implementations, trainers, datasets and scaffolds linked to this eval.

FAQ

What is Mastermind?
Single-turn Mastermind environment with information-gain / elimination scoring
What is the current top score on Mastermind?
The top reported score is 29.1% by Qwen3 Next 80B A3B Instruct, across 6 models reporting (1 from frontier labs).
How can a model improve its Mastermind score?
Tools linked to Mastermind on Sophon include Mastermind RL Env (Community) - RL environments, datasets, and scaffolds that target this eval.
What license is Mastermind under?
Mastermind is available under unknown.