0

Blueprint-Bench 2

Spatial-reasoning benchmark over architectural blueprints (version 2).

Domain
spatial-reasoning
Published
Jun 2026
Notable for
Spatial-reasoning benchmark used at Claude Fable 5 launch.

Cite

Notes

Only stored in your browser.

Attribution

Leaderboard scores
Anthropic
Attribution policy →

Top score 38.6% by Claude Fable 5 - 2 models reporting (2 frontier)

Score history

2
30%48%65%83%100%Apr 26May 26Jun 26GPT-5.5Claude Fable 5

Top models

2
Blueprint-Bench 2Bar chart with 2 bars. Highest value: Claude Fable 5 at 38.6.
2 models

FAQ

What is Blueprint-Bench 2?
Spatial-reasoning benchmark over architectural blueprints (version 2).
What is the current top score on Blueprint-Bench 2?
The top reported score is 38.6% by Claude Fable 5, across 2 models reporting (2 from frontier labs).