0

MMLU Pro

MMLU-Pro dataset is a more robust and challenging massive multi-task understanding dataset tailored to more rigorously benchmark large language models' capabilities. This dataset contains 12K complex questions across various disciplines.

Domain
rl-env
License
unknown
Published
Jan 2026

Cite

Notes

Only stored in your browser.

Attribution

Leaderboard scores
OpenReward
Attribution policy →

Top score 88.5 by Qwen3.6 Plus - 4 models reporting

Score history

2
0255075100Feb 26Mar 26Apr 26Qwen3.5 397B A17BQwen3.6 Plus

Top models

4
MMLU ProBar chart with 4 bars. Highest value: Qwen3.6 Plus at 88.5.
4 models

FAQ

What is MMLU Pro?
MMLU-Pro dataset is a more robust and challenging massive multi-task understanding dataset tailored to more rigorously benchmark large language models' capabilities. This dataset contains 12K complex questions across various disciplines.
What is the current top score on MMLU Pro?
The top reported score is 88.5 by Qwen3.6 Plus, across 4 models reporting.
What license is MMLU Pro under?
MMLU Pro is available under unknown.