GSM8K: Grade School Math Word Problems
Active
Measures how effectively language models solve realistic, linguistically rich math word problems suitable for grade-school-level mathematics.
- Publisher
- OpenAI
- Domain
- Mathematics
- License
- mit
- Published
- May 2026
- Notable for
- Benchmark for evaluating Mathematics.
Cite
Notes
Only stored in your browser.
Related tools
18Implementations, trainers, datasets and scaffolds linked to this eval.
Papers
1FAQ
- What is GSM8K: Grade School Math Word Problems?
- Measures how effectively language models solve realistic, linguistically rich math word problems suitable for grade-school-level mathematics.
- How can a model improve its GSM8K: Grade School Math Word Problems score?
- Tools linked to GSM8K: Grade School Math Word Problems on Sophon include Gsm8k RL Env (Community), Gsm8k RL Env (Dev Team), Gsm8k RL Env (Prime Intellect), Discover Gsm8k RL Env (Community) - RL environments, datasets, and scaffolds that target this eval.
- What license is GSM8K: Grade School Math Word Problems under?
- GSM8K: Grade School Math Word Problems is available under mit.