OfficeQA
OfficeQA is a benchmark for evaluating AI agents on grounded, multi-document reasoning over a large and heterogeneous document corpus. The corpus consists of U.S. Treasury Bulletins spanning nearly 100 years, comprising 89,000 pages and over 26 million numerical values. Office…
- Domain
- rl-env
- License
- unknown
- Published
- Mar 2026
Cite
Notes
Only stored in your browser.
Top models
2FAQ
- What is OfficeQA?
- OfficeQA is a benchmark for evaluating AI agents on grounded, multi-document reasoning over a large and heterogeneous document corpus. The corpus consists of U.S. Treasury Bulletins spanning nearly 100 years, comprising 89,000 pages and over 26 million numerical values. Office…
- What license is OfficeQA under?
- OfficeQA is available under unknown.