0

LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code

UC Berkeley benchmark that continuously scrapes new LeetCode/AtCoder/CodeForces problems to give a contamination-free, time-stamped coding leaderboard.

Year
2024
Venue
NeurIPS
Authors
10
Hosting
External sourcelicense unknown

Cite

Notes

Only stored in your browser.

Introduces 1 artifact - 1 eval

TL;DR

Semantic Scholar

This work proposes LiveCodeBench, a comprehensive and contamination-free evaluation of LLMs for code, which continuously collects new problems over time from contests across three competition platforms, namely LeetCode, AtCoder, and CodeForces.

Artifacts

1

Authors

10