Aider Polyglot
Fresh
Aider’s polyglot benchmark tests LLMs on 225 challenging Exercism coding exercises across C++, Go, Java, JavaScript, Python, and Rust.
- Type
- RL Env
- Publisher
- General Reasoning
- Capabilities
- Code Generation
- Runtime
ORS- License
- unknown
- Size
- 446 tasks
- Published
- Jan 2026
Cite
Notes
Only stored in your browser.
Public scores on this env
11 vf-eval report across 1 model