Research Code Bench

Fresh

Coding challenges that evaluates LLMs' ability to translate cutting-edge ML contributions from top 2024-2025 research papers into executable code.

Type: RL Env
Capabilities: Code Generation
Tags: Machine Learning Engineering
Runtime: ORS
License: unknown
Size: 212 tasks
Published: Feb 2026
Canonical: openreward.ai/PatrickHua/research-code-bench

Cite

Notes

Only stored in your browser.

Attribution

README: openreward.ai/PatrickHua/research-code-bench

Attribution policy →

Contributors

1