Agent Multistep RL Env (Community)
Fresh
A multi-turn agent environment from ACEBench that evaluates a model's ability to perform complex, sequential tool-use tasks to reach a correct fina...
- Type
- RL Env
- Runtime
multi-turn- License
- unknown
- Size
- v0.1.2
- Published
- Oct 2025
Cite
Notes
Only stored in your browser.
Public scores on this env
44 vf-eval reports across 4 models
1Qwen3 Coder 30B A3B InstructAlibaba63.3%2Qwen3 30B A3BAlibaba55.0%3Qwen3 4BAlibaba31.7%4Qwen3 4B InstructAlibaba30.0%
Open the scoring view →