Hanabi
Multi-turn cooperative card game environment where models play Hanabi by making strategic moves based on partial information.
- Domain
- rl-env
- License
- unknown
- Published
- Oct 2025
Cite
Notes
Only stored in your browser.
Top score 4.0% by o4 Mini - 2 models reporting (2 frontier)
Score history
2Top models
2Related tools
1Implementations, trainers, datasets and scaffolds linked to this eval.
FAQ
- What is Hanabi?
- Multi-turn cooperative card game environment where models play Hanabi by making strategic moves based on partial information.
- What is the current top score on Hanabi?
- The top reported score is 4.0% by o4 Mini, across 2 models reporting (2 from frontier labs).
- How can a model improve its Hanabi score?
- Tools linked to Hanabi on Sophon include Hanabi RL Env (Community) - RL environments, datasets, and scaffolds that target this eval.
- What license is Hanabi under?
- Hanabi is available under unknown.