0

SWE Bench Multilingual

Fresh

A benchmark of 300 software engineering tasks across 42 repositories and 9 programming languages: C, C++, Go, Java, JavaScript, TypeScript, PHP, Ruby, and Rust. Each instance is derived from a real GitHub pull request, following the same format and evaluation protocol as SWE-b…

Type
RL Env
Capabilities
Code Generation
Runtime
ORS
License
unknown
Size
0 tasks
Published
Jan 2026

Cite

Notes

Only stored in your browser.