Medconceptsqa RL Env (Medarc)

Fresh

Your environment description here

Type: RL Env
Publisher: Medarc
Runtime: single-turn
License: unknown
Size: v0.1.1
Published: Dec 2025
Canonical: app.primeintellect.ai/dashboard/environments/medarc/medconceptsqa

Cite

Notes

Only stored in your browser.

Attribution

README: api.primeintellect.ai/api/v1/environmentshub/medarc/medconceptsqa/@0.1.1/inspect

Attribution policy →

medconceptsqa

Overview

Environment ID: medconceptsqa
Short description: MedConcepts QA - an MCQ dataset involving medical codes.
Tags: medical, clinical, single-turn, multiple-choice, classification, test

Datasets

Primary dataset(s): medconceptsqa
Source links: Paper, Github, HF Dataset
Split sizes: 60 (dev / few-shot), 820k (test)

Task

Type: single-turn
Rubric overview: Binary scoring based on correct answer choice

Quickstart

Run an evaluation with default settings:

prime eval run medconceptsqa -m "openai/gpt-5-mini" -n 5 -s

Configure model and sampling:

medarc-eval medconceptsqa -m "openai/gpt-5-mini" -n 20 --num-few-shot 4

Notes:

Use direct environment flags with medarc-eval (for example, --split validation or --judge-model gpt-5-mini).

Environment Arguments

Arg	Type	Default	Description
`num_few_shot`	int	`0`	Number of few-shot examples to include in the prompt
`use_think`	bool	`False`	Whether to use `<think>...</think>` formatting with `ThinkParser`

Metrics

Metric	Meaning
`accuracy`	Exact match on target answer

Authors

This environment has been put together by:

Anish Mahishi - (@macandro96)