Agent Bench RL Env (Community)
Fresh
Your environment description here
- Type
- RL Env
- Runtime
multi-turn- License
- unknown
- Size
- v0.1.0
- Published
- Aug 2025
Cite
Notes
Only stored in your browser.
Public scores on this env
48 vf-eval reports across 4 models
1Qwen2.5 Coder 32B InstructAlibabadisputed100.0%2Qwen3 30B A3BAlibabadisputed84.0%3Qwen3 30B A3B Instruct 2507Alibaba58.0%4Llama 3.3 70B Instruct FP4NVIDIA38.7%
Open the scoring view →