AA-Omniscience: A Benchmark for Long-Tail Factual Knowledge
Artificial Analysis's benchmark of 6,000 expert-written long-tail factual questions across 42 economically relevant topics, scored with a hallucination-penalized metric.
- Publisher
- Artificial Analysis
- Year
- 2025
- Venue
- preprint
- Authors
- 1
- Hosting
- External sourcelicense unknown
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- artificialanalysis.ai/evaluations/aa-omniscience
- TL;DR
- Semantic Scholar