MMLU RL Env (Community)
Fresh
MMLU evaluator for multi-subject multiple-choice reasoning.
- Type
- RL Env
- Runtime
single-turn- License
- unknown
- Size
- v0.1.0
- Published
- Dec 2025
Cite
Notes
Only stored in your browser.
Lift evidence
1| Eval | Tools known to lift | Source paper |
|---|---|---|
| Measuring Massive Multitask Language Understanding | MMLU RL Env (Community) | - |
Evals this tool implements
2Same problem set, this tool's harness. Run it to score a model on the test.