0

General Agent

A self-growing toolbench environment - early signs of self-improving agentic capability

Domain
rl-env
License
unknown
Published
Apr 2026

Cite

Notes

Only stored in your browser.

Attribution

Leaderboard scores
prime-hub
Attribution policy →

Top score 60.0% by GPT-5 Mini - 1 model reporting (1 frontier)

Top models

1
General AgentBar chart with 1 bar. Highest value: GPT-5 Mini at 60.
1 model

Related tools

1
View all

Implementations, trainers, datasets and scaffolds linked to this eval.

FAQ

What is General Agent?
A self-growing toolbench environment - early signs of self-improving agentic capability
What is the current top score on General Agent?
The top reported score is 60.0% by GPT-5 Mini, across 1 model reporting (1 from frontier labs).
How can a model improve its General Agent score?
Tools linked to General Agent on Sophon include General Agent RL Env (Prime Intellect) - RL environments, datasets, and scaffolds that target this eval.
What license is General Agent under?
General Agent is available under unknown.