Backend Bench RL Env (Community)
Fresh
Environment to evaluate LLMs on the ability to generate correct and fast GPU kernels, passing tests provided by `Torch`
- Type
- RL Env
- Runtime
multi-turn- License
- unknown
- Size
- v0.3.14
- Published
- Sep 2025
Cite
Notes
Only stored in your browser.
Public scores on this env
12 vf-eval reports across 1 model