ScholarSearch
Frontier
ScholarSearch is designed to evaluate the complex information retrieval capabilities of Large Language Models (LLMs) in academic research.
- Domain
- rl-env
- License
- unknown
- Published
- Jan 2026
Cite
Notes
Only stored in your browser.
Top score 12.38 by DeepSeek R1 - 6 models reporting (6 frontier)
Score history
3Top models
6FAQ
- What is ScholarSearch?
- ScholarSearch is designed to evaluate the complex information retrieval capabilities of Large Language Models (LLMs) in academic research.
- What is the current top score on ScholarSearch?
- The top reported score is 12.38 by DeepSeek R1, across 6 models reporting (6 from frontier labs).
- What license is ScholarSearch under?
- ScholarSearch is available under unknown.