0

GSM8K: Grade School Math Word Problems

Active

Measures how effectively language models solve realistic, linguistically rich math word problems suitable for grade-school-level mathematics.

Publisher
OpenAI
Domain
Mathematics
License
mit
Published
May 2026
Notable for
Benchmark for evaluating Mathematics.

Cite

Notes

Only stored in your browser.

Related tools

18
View all

Implementations, trainers, datasets and scaffolds linked to this eval.

Papers

1

FAQ

What is GSM8K: Grade School Math Word Problems?
Measures how effectively language models solve realistic, linguistically rich math word problems suitable for grade-school-level mathematics.
How can a model improve its GSM8K: Grade School Math Word Problems score?
Tools linked to GSM8K: Grade School Math Word Problems on Sophon include Gsm8k RL Env (Community), Gsm8k RL Env (Dev Team), Gsm8k RL Env (Prime Intellect), Discover Gsm8k RL Env (Community) - RL environments, datasets, and scaffolds that target this eval.
What license is GSM8K: Grade School Math Word Problems under?
GSM8K: Grade School Math Word Problems is available under mit.