0

Gridgame

Multimodal textured grid analysis environment

Domain
rl-env
License
unknown
Published
Sep 2025

Cite

Notes

Only stored in your browser.

Attribution

Leaderboard scores
prime-hub
Attribution policy →

Top score 65.0% by GPT-4.1 Mini - 2 models reporting (2 frontier)

Score history

2
0%25%50%75%100%Apr 25Jun 25Aug 25GPT-4.1 Mini

Top models

2
GridgameBar chart with 2 bars. Highest value: GPT-4.1 Mini at 65.
2 models

Related tools

1
View all

Implementations, trainers, datasets and scaffolds linked to this eval.

FAQ

What is Gridgame?
Multimodal textured grid analysis environment
What is the current top score on Gridgame?
The top reported score is 65.0% by GPT-4.1 Mini, across 2 models reporting (2 from frontier labs).
How can a model improve its Gridgame score?
Tools linked to Gridgame on Sophon include Gridgame RL Env (Community) - RL environments, datasets, and scaffolds that target this eval.
What license is Gridgame under?
Gridgame is available under unknown.