0

Spec RL

Fresh

HumanEval-style code RL environment whose rollouts are served by the DFlash-speculated Laguna XS.2 vLLM endpoint — same reward curve, cheaper rollo...

Type
RL Env
Runtime
single-turn
License
unknown
Size
v0.1.5
Published
May 2026

Cite

Notes

Only stored in your browser.

Contributors

1