Question 1

What is Bixbench?

Accepted Answer

BixBench scientific reasoning evaluation environment

Question 2

What is the current top score on Bixbench?

Accepted Answer

The top reported score is 0.0% by GPT-4.1 Mini, across 2 models reporting (2 from frontier labs).

Question 3

How can a model improve its Bixbench score?

Accepted Answer

Tools linked to Bixbench on Sophon include Bixbench RL Env (Prime Community) - RL environments, datasets, and scaffolds that target this eval.

Question 4

What license is Bixbench under?

Accepted Answer

Bixbench is available under mit.

Bixbench

Top models