0

LiveBench - Reasoning

Frontier

Reasoning sub-leaderboard of LiveBench: theory-of-mind, zebra puzzle, spatial, and logic-with-navigation tasks.

Publisher
Abacus.AI
Published
May 2026
Canonical
livebench.ai

Cite

Notes

Only stored in your browser.

Attribution

Leaderboard scores
LiveBench
Attribution policy →

Top score 89.7% by Claude Opus 4.8 - 52 models reporting (30 frontier)

Score history

50
30%48%65%83%100%May 25Aug 25Nov 25Feb 26May 26Claude 4 SonnetGrok 4GPT-5.1-CodexClaude Opus 4.6Claude Opus 4.8

Top models

52
LiveBench - ReasoningBar chart with 21 bars. Highest value: Claude Opus 4.8 at 89.7.
21 models

FAQ

What is LiveBench - Reasoning?
Reasoning sub-leaderboard of LiveBench: theory-of-mind, zebra puzzle, spatial, and logic-with-navigation tasks.
What is the current top score on LiveBench - Reasoning?
The top reported score is 89.7% by Claude Opus 4.8, across 52 models reporting (30 from frontier labs).