Agent Bench RL Env (Prime Intellect)
Fresh
A realistic virtual EHR environment to benchmark medical LLM agents on clinical tasks.
- Type
- RL Env
- Publisher
- Prime Intellect
- Runtime
multi-turn- License
- unknown
- Size
- v0.1.2
- Published
- Aug 2025
Cite
Notes
Only stored in your browser.
Public scores on this env
24 vf-eval reports across 2 models