Medqa Followup RL Env (Community)

Fresh

Multi-turn robustness evaluation for medical LLMs - tests whether models maintain correct answers when challenged with follow-up interventions

Type: RL Env
Tags: Medical Robustness
Runtime: multi-turn
License: unknown
Size: v0.2.3
Published: Dec 2025
Canonical: app.primeintellect.ai/dashboard/environments/dynamo-ai/medqa-followup

Cite

Notes

Only stored in your browser.

Attribution

README: api.primeintellect.ai/api/v1/environmentshub/dynamo-ai/medqa-followup/@0.2.3/inspect
Scores: prime-hub

Public scores on this env

4 vf-eval reports across 2 models

Eval	Tools known to lift	Source paper
MedQA: Medical exam Q&A benchmark	Medqa Followup RL Env (Community)	-