LiveCodeBench
Frontier
Rolling competitive-programming benchmark that scrapes LeetCode / AtCoder / Codeforces problems after a known cutoff to fight contamination.
- Publisher
- University of California, Berkeley
- Capabilities
- Code GenerationDebugging
- Domain
- code
- Format
- Custom
- Size
- 1055 tasks
- License
- MIT
- Published
- Mar 2024
- Updates
- Monthly
- Notable for
- The reference contamination-free coding leaderboard — problems are date-stamped so each model is only scored on problems released after its training cutoff.
- Canonical
- livecodebench.github.io
- Official leaderboard
- livecodebench.github.io/leaderboard.html
Cite
Notes
Only stored in your browser.
Top score 91.7% by Gemini 3 Pro - 279 models reporting (62 frontier)
Score history
279Top models
279Where it's ranked
2Related tools
2Implementations, trainers, datasets and scaffolds linked to this eval.
Papers
2Contributors
1FAQ
- What is LiveCodeBench?
- Rolling competitive-programming benchmark that scrapes LeetCode / AtCoder / Codeforces problems after a known cutoff to fight contamination.
- What capabilities does LiveCodeBench test?
- LiveCodeBench evaluates code generation, debugging.
- What is the current top score on LiveCodeBench?
- The top reported score is 91.7% by Gemini 3 Pro, across 279 models reporting (62 from frontier labs).
- How can a model improve its LiveCodeBench score?
- Tools linked to LiveCodeBench on Sophon include Livecodebench RL Env (Prime Intellect), OpenThoughts - RL environments, datasets, and scaffolds that target this eval.
- What license is LiveCodeBench under?
- LiveCodeBench is available under MIT.

