MBPP+
Extended MBPP with 35× more test cases (EvalPlus). Harder than vanilla MBPP.
- Publisher
- EvalPlus Team
- Published
- Apr 2023
- Canonical
- github.com/evalplus/evalplus
Cite
Notes
Only stored in your browser.
Top score 77.0% by Qwen2.5 Coder 32B Instruct - 13 models reporting (1 frontier)
Score history
5Top models
13FAQ
- What is MBPP+?
- Extended MBPP with 35× more test cases (EvalPlus). Harder than vanilla MBPP.
- What is the current top score on MBPP+?
- The top reported score is 77.0% by Qwen2.5 Coder 32B Instruct, across 13 models reporting (1 from frontier labs).