0

Humanity's Last Exam

CAIS + Scale AI benchmark of ~3,000 expert-authored questions spanning every academic subject, designed to be the hardest closed-ended exam for frontier models.

Year
2025
Venue
preprint
Authors
9
Hosting
External sourcelicense unknown

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/2501.14249
TL;DR
Semantic Scholar
Attribution policy →