Question 1

What is Lisanbench?

Accepted Answer

Single-turn evaluation where the model is tasked to generate the longest valid chain of 1-word edits from a given starting word. The final score is the sum of the longest valid chains across all starting words.

Question 2

What is the current top score on Lisanbench?

Accepted Answer

The top reported score is 13.22 by Qwen3 4B, across 1 model reporting.

Question 3

How can a model improve its Lisanbench score?

Accepted Answer

Tools linked to Lisanbench on Sophon include Lisanbench RL Env (Prime Intellect) - RL environments, datasets, and scaffolds that target this eval.

Question 4

What license is Lisanbench under?

Accepted Answer

Lisanbench is available under unknown.

Lisanbench

Top models

Related tools

Lisanbench RL Env (Prime Intellect)

FAQ