Program Synthesis with Large Language Models
Google paper that introduces MBPP - 974 short crowd-sourced Python problems with unit tests - and MathQA-Python, longtime companions to HumanEval.
- Publisher
- Google Research
- Year
- 2021
- Venue
- preprint
- Authors
- 11
- Hosting
- External sourcelicense unknown
Cite
Notes
Only stored in your browser.
Introduces 1 artifact - 1 eval
TL;DR
Semantic Scholar
The limits of the current generation of large language models for program synthesis in general purpose programming languages are explored, and the semantic grounding of these models is explored by fine-tuning them to predict the results of program execution.