GPQA Diamond RL Env (Community)
Fresh
GPQA Diamond: A Graduate-Level Google-Proof Q&A Benchmark
- Type
- RL Env
- License
- unknown
- Size
- v0.1.0
- Published
- Feb 2026
Cite
Notes
Only stored in your browser.
Lift evidence
2| Eval | Tools known to lift | Source paper |
|---|---|---|
| GPQA (Full Set) | GPQA Diamond RL Env (Community) | - |
| GPQA: Graduate-Level STEM Knowledge Challenge | GPQA Diamond RL Env (Community) | - |
Evals this tool implements
1Same problem set, this tool's harness. Run it to score a model on the test.