Question 1

What is Med Agent Bench?

Accepted Answer

A realistic virtual EHR environment to benchmark medical LLM agents on clinical tasks.

Question 2

What is the current top score on Med Agent Bench?

Accepted Answer

The top reported score is 84.0% by Qwen3 30B A3B, across 2 models reporting.

Question 3

How can a model improve its Med Agent Bench score?

Accepted Answer

Tools linked to Med Agent Bench on Sophon include Agent Bench RL Env (Prime Intellect) - RL environments, datasets, and scaffolds that target this eval.

Question 4

What license is Med Agent Bench under?

Accepted Answer

Med Agent Bench is available under unknown.

Med Agent Bench

Score history

Top models

Related tools

Agent Bench RL Env (Prime Intellect)

FAQ