Cite
Notes
Only stored in your browser.
Attribution
RecoveryBench: benchmark whether coding agents recover from a wrong first attempt.
PatchRecoveryGym: evaluate whether coding agents recover from a wrong first patch.