Question 1

What is Browsecomp Plus?

Accepted Answer

Verifiers environment for BrowseComp-Plus Deep-Research Agent Benchmark. Controlled agent/retriever evaluation on the fixed human-verified corpus.

Question 2

What is the current top score on Browsecomp Plus?

Accepted Answer

The top reported score is 1.11 by GPT-4.1 Mini, across 3 models reporting (1 from frontier labs).

Question 3

How can a model improve its Browsecomp Plus score?

Accepted Answer

Tools linked to Browsecomp Plus on Sophon include Browsecomp PLUS RL Env (Prime Intellect) - RL environments, datasets, and scaffolds that target this eval.

Question 4

What license is Browsecomp Plus under?

Accepted Answer

Browsecomp Plus is available under apache-2.0.

Browsecomp Plus

Score history

Top models

Related tools

Browsecomp PLUS RL Env (Prime Intellect)

FAQ