LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code
UC Berkeley benchmark that continuously scrapes new LeetCode/AtCoder/CodeForces problems to give a contamination-free, time-stamped coding leaderboard.
- Publisher
- University of California, Berkeley
- Year
- 2024
- Venue
- NeurIPS
- Authors
- 10
- Hosting
- External sourcelicense unknown
Cite
Notes
Only stored in your browser.
Introduces 1 artifact - 1 eval
TL;DR
Semantic Scholar
This work proposes LiveCodeBench, a comprehensive and contamination-free evaluation of LLMs for code, which continuously collects new problems over time from contests across three competition platforms, namely LeetCode, AtCoder, and CodeForces.
Artifacts
1Evals