0

Training Verifiers to Solve Math Word Problems

Introduces GSM8K (8.5k grade-school math word problems) and shows that training a verifier to re-rank generated solutions outperforms simply fine-tuning on the dataset.

Preview
First page of Training Verifiers to Solve Math Word Problems
Publisher
OpenAI
Year
2021
Venue
preprint
Authors
12
Hosting
External sourcelicense unknown

Cite

Notes

Only stored in your browser.

Introduces 1 artifact - 1 eval

TL;DR

Semantic Scholar

It is demonstrated that verification significantly improves performance on GSM8K, and there is strong empirical evidence that verification scales more effectively with increased data than a finetuning baseline.

Artifacts

1

Evals

Authors

12