Training Verifiers to Solve Math Word Problems
Introduces GSM8K (8.5k grade-school math word problems) and shows that training a verifier to re-rank generated solutions outperforms simply fine-tuning on the dataset.
- Publisher
- OpenAI
- Year
- 2021
- Venue
- preprint
- Authors
- 12
- Hosting
- External sourcelicense unknown
Cite
Notes
Only stored in your browser.
Introduces 1 artifact - 1 eval
TL;DR
Semantic Scholar
It is demonstrated that verification significantly improves performance on GSM8K, and there is strong empirical evidence that verification scales more effectively with increased data than a finetuning baseline.
Artifacts
1Evals