Question 1

What is Hanabi?

Accepted Answer

Multi-turn cooperative card game environment where models play Hanabi by making strategic moves based on partial information.

Question 2

What is the current top score on Hanabi?

Accepted Answer

The top reported score is 4.0% by o4 Mini, across 2 models reporting (2 from frontier labs).

Question 3

How can a model improve its Hanabi score?

Accepted Answer

Tools linked to Hanabi on Sophon include Hanabi RL Env (Community) - RL environments, datasets, and scaffolds that target this eval.

Question 4

What license is Hanabi under?

Accepted Answer

Hanabi is available under unknown.

Hanabi

Score history