0

SWE-Lancer

Active

1,488 real freelance software-engineering tasks from Upwork worth $1M total in payouts, evaluating models on end-to-end paid developer work.

Publisher
OpenAI
Domain
code
Format
Custom
Size
1488 tasks
License
MIT
Published
Feb 2025
Notable for
Benchmark for evaluating code editing, code generation and planning in the code domain.

Cite

Notes

Only stored in your browser.