0

Medpt

Saturated

Verifiers port for MedPT dataset

Domain
rl-env
License
unknown
Published
Mar 2026

Cite

Notes

Only stored in your browser.

Attribution

Leaderboard scores
prime-hub
Attribution policy →

Top score 100.0% by Grok 4.1 Fast - 2 models reporting (1 frontier)

Score history

2
25%44%63%81%100%Nov 25Dec 25Grok 4.1 Fast

Top models

2
MedptBar chart with 2 bars. Highest value: Grok 4.1 Fast at 100.
2 models

Related tools

1
View all

Implementations, trainers, datasets and scaffolds linked to this eval.

FAQ

What is Medpt?
Verifiers port for MedPT dataset
What is the current top score on Medpt?
The top reported score is 100.0% by Grok 4.1 Fast, across 2 models reporting (1 from frontier labs).
How can a model improve its Medpt score?
Tools linked to Medpt on Sophon include Medpt RL Env (Community) - RL environments, datasets, and scaffolds that target this eval.
What license is Medpt under?
Medpt is available under unknown.