SWE Atlas QnA

Fresh

Codebase QnA is the first benchmark in the SWE-Atlas suite. It evaluates AI agents on deep code comprehension - tracing execution paths, explaining architectural decisions, and answering deeply technical questions about production-grade software systems.

Type: RL Env
Publisher: General Reasoning
Tags: Code Understanding and Reasoning
Runtime: ORS
License: unknown
Size: 124 tasks
Published: Apr 2026
Canonical: openreward.ai/GeneralReasoning/SWE-Atlas-QnA

Cite

Notes

Only stored in your browser.

Attribution

README: openreward.ai/GeneralReasoning/SWE-Atlas-QnA
Scores: OpenReward

Attribution policy →

Public scores on this env

3 vf-eval reports across 2 models

1GPT-5.4OpenAI40.8 2Opus 4.6 (Claude Code)33.3

Open the scoring view →