0

Research Code Bench

Fresh

Coding challenges that evaluates LLMs' ability to translate cutting-edge ML contributions from top 2024-2025 research papers into executable code.

Type
RL Env
Capabilities
Code Generation
Runtime
ORS
License
unknown
Size
212 tasks
Published
Feb 2026

Cite

Notes

Only stored in your browser.

Contributors

1