Paperbench ENV RL Env (Community)
Fresh
Multi-turn environment where AI agents with access to web search and sandbox execution reproduce research papers; evaluated against detailed hierar...
- Type
- RL Env
- Runtime
multi-turn- License
- unknown
- Size
- v0.1.3
- Published
- Oct 2025
Cite
Notes
Only stored in your browser.
Lift evidence
1| Eval | Tools known to lift | Source paper |
|---|---|---|
| PaperBench: Evaluating AI''s Ability to Replicate AI Research (Work In Progress) | Paperbench ENV RL Env (Community) | - |