Humanity's Last Exam
CAIS + Scale AI benchmark of ~3,000 expert-authored questions spanning every academic subject, designed to be the hardest closed-ended exam for frontier models.
- Publisher
- Center for AI Safety
- Year
- 2025
- Venue
- preprint
- Authors
- 9
- Hosting
- External sourcelicense unknown
Cite
Notes
Only stored in your browser.
Introduces 1 artifact - 1 eval