0

Backend Bench RL Env (Community)

Fresh

Environment to evaluate LLMs on the ability to generate correct and fast GPU kernels, passing tests provided by `Torch`

Type
RL Env
Runtime
multi-turn
License
unknown
Size
v0.3.14
Published
Sep 2025

Cite

Notes

Only stored in your browser.

Public scores on this env

1

2 vf-eval reports across 1 model

Open the scoring view →

Contributors

1