Arena Hard Prompts

Name: Arena Hard Prompts
Creator: LMArena

LMArena subcategory ranking models on a filtered slice of Arena prompts auto-classified as hard along multiple difficulty axes.

Operator: LMArena
Kind: Human preference
Updates: live
Notable for: The user-preference view of frontier model separation on hard prompts; correlated with Arena-Hard offline benchmark.
URL: lmarena.ai/leaderboard
Tracks: Preference voting (no benchmark)

Cite

Notes

Only stored in your browser.

Attribution

Backing benchmark

Human-preference voting. No underlying benchmark - models are ranked by pairwise votes, not by a test you can run.