Frontierscience RL Env (Wazupsteve)
Fresh
FrontierScience benchmark for PhD-level science problems across physics, chemistry, and biology
- Type
- RL Env
- Publisher
- Wazupsteve
- Runtime
single-turn- License
- unknown
- Size
- v0.1.0
- Published
- Dec 2025
Cite
Notes
Only stored in your browser.
Evals this tool implements
1Same problem set, this tool's harness. Run it to score a model on the test.