Question 1

What is LitBench?

Accepted Answer

Literary evaluation benchmark using pair-wise comparison with self-critique prompting

Question 2

What is the current top score on LitBench?

Accepted Answer

The top reported score is 80.0% by GLM 4.5 Air, across 7 models reporting (2 from frontier labs).

Question 3

How can a model improve its LitBench score?

Accepted Answer

Tools linked to LitBench on Sophon include Litbench RL Env (Community) - RL environments, datasets, and scaffolds that target this eval.

Question 4

What license is LitBench under?

Accepted Answer

LitBench is available under unknown.

LitBench

Score history