Question 1

What is Mini Swe Agent Bench?

Accepted Answer

Benchmarking model performance on SWE Bench in the Mini SWE Agent harness.

Question 2

What is the current top score on Mini Swe Agent Bench?

Accepted Answer

The top reported score is 93.3% by GPT-5, across 3 models reporting (3 from frontier labs).

Question 3

How can a model improve its Mini Swe Agent Bench score?

Accepted Answer

Tools linked to Mini Swe Agent Bench on Sophon include Agent Bench RL Env (Prime Community) - RL environments, datasets, and scaffolds that target this eval.

Question 4

What license is Mini Swe Agent Bench under?

Accepted Answer

Mini Swe Agent Bench is available under unknown.

Mini Swe Agent Bench

Score history

Top models

Related tools

Agent Bench RL Env (Prime Community)

FAQ