Question 1

What is Mlebench?

Accepted Answer

MLE-Bench

Question 2

What is the current top score on Mlebench?

Accepted Answer

The top reported score is 100.0% by Claude 4.5 Haiku, across 3 models reporting (3 from frontier labs).

Question 3

How can a model improve its Mlebench score?

Accepted Answer

Tools linked to Mlebench on Sophon include Mlebench RL Env (Community) - RL environments, datasets, and scaffolds that target this eval.

Question 4

What license is Mlebench under?

Accepted Answer

Mlebench is available under unknown.

Mlebench

Score history