Cite
Notes
Only stored in your browser.
Attribution
SlopCodeBench: Benchmarking How Coding Agents Degrade Over Long-Horizon Iterative Tasks
arXiv 2026
Reward Models Enable Scalable Code Verification by Trading Accuracy for Throughput
arXiv 2025
Measuring The Impact Of Programming Language Distribution
arXiv 2023
from 3 papers
Aws Albarghouthi
Frederic Sala
Albert Ge
Alex Gu
grad-student
Alexander Yun
Changho Shin
Devjeet Roy
Dyah Adila
Jacob Austin
researcher
Jeffrey Hui