Research Code Bench
Fresh
Coding challenges that evaluates LLMs' ability to translate cutting-edge ML contributions from top 2024-2025 research papers into executable code.
- Type
- RL Env
- Capabilities
- Code Generation
- Runtime
ORS- License
- unknown
- Size
- 212 tasks
- Published
- Feb 2026
Cite
Notes
Only stored in your browser.