0

Med Agent Bench

Saturated

Your environment description here

Domain
rl-env
License
unknown
Published
Aug 2025

Cite

Notes

Only stored in your browser.

Attribution

Leaderboard scores
prime-hub
Attribution policy →

Top score 100.0% by Qwen2.5 Coder 32B Instruct - 4 models reporting

Score history

3
50%63%75%88%100%Nov 24Jan 25Mar 25May 25Jul 25Qwen2.5 Coder 32B Instruct

Top models

4
Med Agent BenchBar chart with 4 bars. Highest value: Qwen2.5 Coder 32B Instruct at 100.
4 models

Related tools

1
View all

Implementations, trainers, datasets and scaffolds linked to this eval.

FAQ

What is Med Agent Bench?
Your environment description here
What is the current top score on Med Agent Bench?
The top reported score is 100.0% by Qwen2.5 Coder 32B Instruct, across 4 models reporting.
How can a model improve its Med Agent Bench score?
Tools linked to Med Agent Bench on Sophon include Agent Bench RL Env (Community) - RL environments, datasets, and scaffolds that target this eval.
What license is Med Agent Bench under?
Med Agent Bench is available under unknown.