Question 1

What is FrontierScience: Expert-Level Scientific Reasoning?

Accepted Answer

Evaluates AI capabilities for expert-level scientific reasoning across physics, chemistry, and biology. Contains 160 problems with two evaluation formats: Olympic (100 samples with reference answers) and Research (60 samples with rubrics).

Question 2

What is the current top score on FrontierScience: Expert-Level Scientific Reasoning?

Accepted Answer

The top reported score is 44.8% by GPT-5.5, across 7 models reporting (1 from frontier labs).

Question 3

How can a model improve its FrontierScience: Expert-Level Scientific Reasoning score?

Accepted Answer

Tools linked to FrontierScience: Expert-Level Scientific Reasoning on Sophon include Frontierscience RL Env (Prime Intellect), Frontierscience RL Env (Wazupsteve) - RL environments, datasets, and scaffolds that target this eval.

Question 4

What license is FrontierScience: Expert-Level Scientific Reasoning under?

Accepted Answer

FrontierScience: Expert-Level Scientific Reasoning is available under mit.

FrontierScience: Expert-Level Scientific Reasoning

Score history

Top models

Related tools

Frontierscience RL Env (Prime Intellect)

Frontierscience RL Env (Wazupsteve)

FAQ