Cite
Notes
Only stored in your browser.
Attribution
SlopCodeBench: Benchmarking How Coding Agents Degrade Over Long-Horizon Iterative Tasks
arXiv 2026
SkillOrchestra: Learning to Route Agents via Skill Transfer
Reward Models Enable Scalable Code Verification by Trading Accuracy for Throughput
arXiv 2025
from 3 papers
Frederic Sala
Gabriel Orlanski
Albert Ge
Alex Gu
grad-student
Alexander Yun
Changho Shin
Devjeet Roy
Dyah Adila
Jiayu Wang
Nicholas Roberts