Question 1

What is Clockbench?

Accepted Answer

ClockBench: multimodal clock reading and reasoning benchmark implemented for verifiers.

Question 2

What is the current top score on Clockbench?

Accepted Answer

The top reported score is 41.7% by GPT-4.1 Mini, across 2 models reporting (2 from frontier labs).

Question 3

How can a model improve its Clockbench score?

Accepted Answer

Tools linked to Clockbench on Sophon include Clockbench RL Env (Community) - RL environments, datasets, and scaffolds that target this eval.

Question 4

What license is Clockbench under?

Accepted Answer

Clockbench is available under unknown.

Clockbench

Score history