SWE-Lancer
Active
1,488 real freelance software-engineering tasks from Upwork worth $1M total in payouts, evaluating models on end-to-end paid developer work.
- Publisher
- OpenAI
- Capabilities
- Code EditingCode GenerationPlanningTool Calling
- Domain
- code
- Format
- Custom
- Size
- 1488 tasks
- License
- MIT
- Published
- Feb 2025
- Notable for
- Benchmark for evaluating code editing, code generation and planning in the code domain.
Cite
Notes
Only stored in your browser.
Where it's ranked
1Papers
2Contributors
1FAQ
- What is SWE-Lancer?
- 1,488 real freelance software-engineering tasks from Upwork worth $1M total in payouts, evaluating models on end-to-end paid developer work.
- What capabilities does SWE-Lancer test?
- SWE-Lancer evaluates code editing, code generation, planning, tool calling.
- What license is SWE-Lancer under?
- SWE-Lancer is available under MIT.