What is the current top score on MathArena?

The top reported score is 92.8% by GPT-5.5, across 26 models reporting (10 from frontier labs).

How can a model improve its MathArena score?

Tools linked to MathArena on Sophon include APEX Shortlist RL Env (Prime Intellect) - RL environments, datasets, and scaffolds that target this eval.

Frontier

Live leaderboard of LLMs on recent math-olympiad and research-style problems, refreshed monthly to minimise pretraining contamination.

Publisher: ETH Zürich SRI Lab (Secure, Reliable & Intelligent Systems)
Domain: math
Published: May 2026
Updates: Monthly
Notable for: Continuously refreshed math benchmark — pulls fresh problems each month so frontier models can't game contamination, and reports per-problem accuracy with confidence intervals.
Canonical: matharena.ai
Official leaderboard: matharena.ai

Cite

Notes

Only stored in your browser.

Attribution

Top score 92.8% by GPT-5.5 - 26 models reporting (10 frontier)

MathArenaBar chart with 21 bars. Highest value: GPT-5.5 at 92.8.

21 models

matharena.ai

Implementations, trainers, datasets and scaffolds linked to this eval.

Prime Intellect

MathArena Apex Shortlist final-answer evaluation environment

What is MathArena?: Live leaderboard of LLMs on recent math-olympiad and research-style problems, refreshed monthly to minimise pretraining contamination.
What is the current top score on MathArena?: The top reported score is 92.8% by GPT-5.5, across 26 models reporting (10 from frontier labs).
How can a model improve its MathArena score?: Tools linked to MathArena on Sophon include APEX Shortlist RL Env (Prime Intellect) - RL environments, datasets, and scaffolds that target this eval.