0

FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI

Epoch AI benchmark of hundreds of original research-level math problems authored by professional mathematicians, with auto-verifiable answers.

Publisher
Epoch AI
Year
2024
Venue
preprint
Authors
10
Hosting
External sourcelicense unknown

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/2411.04872
TL;DR
Semantic Scholar
Attribution policy →