0

ScholarSearch

Frontier

ScholarSearch is designed to evaluate the complex information retrieval capabilities of Large Language Models (LLMs) in academic research.

Domain
rl-env
License
unknown
Published
Jan 2026

Cite

Notes

Only stored in your browser.

Attribution

Leaderboard scores
OpenReward
Attribution policy →

Top score 12.38 by DeepSeek R1 - 6 models reporting (6 frontier)

Score history

3
03.757.511.2515Jul 24Sep 24Nov 24Jan 25Mar 25GPT-4o-miniDeepSeek R1

Top models

6
ScholarSearchBar chart with 6 bars. Highest value: GPT 4o Search Preview at 19.1.
6 models

FAQ

What is ScholarSearch?
ScholarSearch is designed to evaluate the complex information retrieval capabilities of Large Language Models (LLMs) in academic research.
What is the current top score on ScholarSearch?
The top reported score is 12.38 by DeepSeek R1, across 6 models reporting (6 from frontier labs).
What license is ScholarSearch under?
ScholarSearch is available under unknown.