Question 1

What is Clockbench?

Accepted Answer

ClockBench: multimodal clock reading and reasoning benchmark implemented for verifiers.

Question 2

What is the current top score on Clockbench?

Accepted Answer

The top reported score is 40.0% by GPT-4.1 Mini, across 5 models reporting (5 from frontier labs).

Question 3

How can a model improve its Clockbench score?

Accepted Answer

Tools linked to Clockbench on Sophon include Clockbench RL Env (Prime Community) - RL environments, datasets, and scaffolds that target this eval.

Question 4

What license is Clockbench under?

Accepted Answer

Clockbench is available under unknown.

Clockbench

Score history