0

Training Verifiers to Solve Math Word Problems

Introduces GSM8K (8.5k grade-school math word problems) and shows that training a verifier to re-rank generated solutions outperforms simply fine-tuning on the dataset.

Publisher
OpenAI
Year
2021
Venue
preprint
Authors
12
Hosting
External sourcelicense unknown

Cite

Notes

Only stored in your browser.

Introduces 1 artifact - 1 eval

TL;DR

Semantic Scholar

It is demonstrated that verification significantly improves performance on GSM8K, and there is strong empirical evidence that verification scales more effectively with increased data than a finetuning baseline.

Artifacts

1

Evals

Authors

12