Question 1

What is Hle Web Py?

Accepted Answer

Humanity's Last Examination (HLE) benchmark environment for prime-environments

Question 2

What is the current top score on Hle Web Py?

Accepted Answer

The top reported score is 0.0% by GPT-4.1 Mini, across 1 model reporting (1 from frontier labs).

Question 3

How can a model improve its Hle Web Py score?

Accepted Answer

Tools linked to Hle Web Py on Sophon include WEB PY RL Env (Prime Intellect) - RL environments, datasets, and scaffolds that target this eval.

Question 4

What license is Hle Web Py under?

Accepted Answer

Hle Web Py is available under unknown.

Hle Web Py

Top models