0

Aider Polyglot

Fresh

Aider’s polyglot benchmark tests LLMs on 225 challenging Exercism coding exercises across C++, Go, Java, JavaScript, Python, and Rust.

Type
RL Env
Capabilities
Code Generation
Runtime
ORS
License
unknown
Size
446 tasks
Published
Jan 2026

Cite

Notes

Only stored in your browser.

Public scores on this env

1

1 vf-eval report across 1 model

Open the scoring view →