0

Alphabet Sort

Saturated

This task requires the model to maintain and update an alphabetically sorted list of names across multiple conversation turns.

Domain
rl-env
License
apache-2.0
Published
Aug 2025

Cite

Notes

Only stored in your browser.

Attribution

Leaderboard scores
prime-hub
Attribution policy →

Top score 98.2% by GPT-4.1 Mini - 1 model reporting (1 frontier)

Top models

1
Alphabet SortBar chart with 1 bar. Highest value: GPT-4.1 Mini at 98.2.
1 model

Related tools

1
View all

Implementations, trainers, datasets and scaffolds linked to this eval.

FAQ

What is Alphabet Sort?
This task requires the model to maintain and update an alphabetically sorted list of names across multiple conversation turns.
What is the current top score on Alphabet Sort?
The top reported score is 98.2% by GPT-4.1 Mini, across 1 model reporting (1 from frontier labs).
How can a model improve its Alphabet Sort score?
Tools linked to Alphabet Sort on Sophon include Alphabet SORT RL Env (Prime Intellect) - RL environments, datasets, and scaffolds that target this eval.
What license is Alphabet Sort under?
Alphabet Sort is available under apache-2.0.