Question 1

What is Curvebench Hard Env?

Accepted Answer

CurveBench-Hard: Vision-language model evaluation for hierarchical tree structure extraction from complex images

Question 2

What is the current top score on Curvebench Hard Env?

Accepted Answer

The top reported score is 22.8% by Gemini 3.1 Pro Preview, across 15 models reporting (5 from frontier labs).

Question 3

How can a model improve its Curvebench Hard Env score?

Accepted Answer

Tools linked to Curvebench Hard Env on Sophon include HARD ENV RL Env (Community) - RL environments, datasets, and scaffolds that target this eval.

Question 4

What license is Curvebench Hard Env under?

Accepted Answer

Curvebench Hard Env is available under mit.

Curvebench Hard Env

Score history