Question 1

What is SWE Atlas QnA?

Accepted Answer

Codebase QnA is the first benchmark in the SWE-Atlas suite. It evaluates AI agents on deep code comprehension - tracing execution paths, explaining architectural decisions, and answering deeply technical questions about production-grade software systems.

Question 2

What is the current top score on SWE Atlas QnA?

Accepted Answer

The top reported score is 40.8 by GPT-5.4, across 2 models reporting (1 from frontier labs).

Question 3

What license is SWE Atlas QnA under?

Accepted Answer

SWE Atlas QnA is available under unknown.

SWE Atlas QnA

Top models

FAQ