0

Code Qa

Progressive code QA environment - fix bugs, parsing edge cases, code cleanup across 5 difficulty levels

Domain
rl-env
License
unknown
Published
Apr 2026

Cite

Notes

Only stored in your browser.

Attribution

Leaderboard scores
prime-hub
Attribution policy →

Top score 0.0% by Olmo 3 7B Instruct - 2 models reporting (1 frontier)

Score history

2
0%25%50%75%100%Nov 25Dec 25Olmo 3 7B Instruct

Top models

2
Code QaBar chart with 2 bars. Highest value: DeepSeek V3.2 at 0.
2 models

Related tools

1
View all

Implementations, trainers, datasets and scaffolds linked to this eval.

FAQ

What is Code Qa?
Progressive code QA environment - fix bugs, parsing edge cases, code cleanup across 5 difficulty levels
What is the current top score on Code Qa?
The top reported score is 0.0% by Olmo 3 7B Instruct, across 2 models reporting (1 from frontier labs).
How can a model improve its Code Qa score?
Tools linked to Code Qa on Sophon include CODE QA RL Env (Community) - RL environments, datasets, and scaffolds that target this eval.
What license is Code Qa under?
Code Qa is available under unknown.