0

Synthetic Clocks

Environment for evaluating LLMs on synthetic analog clock time reading tasks using image URLs and multiple reward criteria.

Domain
rl-env
License
unknown
Published
Oct 2025

Cite

Notes

Only stored in your browser.

Attribution

Leaderboard scores
prime-hub
Attribution policy →

Top score 10.0% by GPT-4.1 - 1 model reporting (1 frontier)

Top models

1
Synthetic ClocksBar chart with 1 bar. Highest value: GPT-4.1 at 10.
1 model

Related tools

1
View all

Implementations, trainers, datasets and scaffolds linked to this eval.

FAQ

What is Synthetic Clocks?
Environment for evaluating LLMs on synthetic analog clock time reading tasks using image URLs and multiple reward criteria.
What is the current top score on Synthetic Clocks?
The top reported score is 10.0% by GPT-4.1, across 1 model reporting (1 from frontier labs).
How can a model improve its Synthetic Clocks score?
Tools linked to Synthetic Clocks on Sophon include Synthetic Clocks RL Env (Prime Intellect) - RL environments, datasets, and scaffolds that target this eval.
What license is Synthetic Clocks under?
Synthetic Clocks is available under unknown.