0

Arena Overall

The headline LMArena (formerly Chatbot Arena) ranking aggregating crowd pairwise preference votes across all prompt categories into a Bradley-Terry Elo-style rating.

Operator
LMArena
Kind
Human preference
Updates
live
Notable for
The most widely cited public ranking of frontier LLMs; new model releases are timed against Arena entrance.
Tracks
Preference voting (no benchmark)

Cite

Notes

Only stored in your browser.

Backing benchmark

Human-preference voting. No underlying benchmark - models are ranked by pairwise votes, not by a test you can run.