John Yang

CS PhD student at Stanford; co-creator of SWE-Bench, SWE-Agent, and WebShop; started the SWE-Agent project at Princeton.

Role: grad-student
Currently at: Stanford University
Twitter: twitter.com/johnyangg
GitHub: github.com/john-b-yang
Scholar: scholar.google.com/citations
Papers: 9

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: scholar.google.com/citations

Attribution policy →

9papers·2eval contribs

Authored papers

Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces

arXiv 2026

2026

mini-SWE-agent: A Minimal Reference Agent for SWE-bench

blog

2025

OpenThoughts: Data Recipes for Reasoning Models

arXiv 2025

2025

SWE-smith: Scaling Data for Software Engineering Agents

arXiv 2025

2025

SWE-bench: Can Language Models Resolve Real-World GitHub Issues?

ICLR

2024

EnIGMA: Enhanced Interactive Generative Model Agent for CTF Challenges

arXiv 2024

2024

Prompting Large Language Models to Tackle the Full Software Development Lifecycle: A Case Study

arXiv 2024

2024

InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback

NeurIPS 2023 11

2023

WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents

arXiv 2022

2022

Eval contributions

SWE-bench Verified

OpenAI

500 human-validated SWE-bench tasks confirmed solvable from the issue alone, with non-flaky test suites - the most-reported agentic coding benchmark.

ActiveCode EditingDebuggingTool CallingCode

SWE-bench

Princeton NLP Group

2,294 real GitHub issues from 12 popular Python repos that require an agent to produce a patch passing the project's test suite.

ActiveCode EditingDebuggingTool CallingCode

Affiliations

Currently at

Stanford University

grad-student · university lab

Previously

Princeton NLP Groupuniversity lab

Frequent co-authors

from 9 papers

Karthik Narasimhan

professor

4 shared papers

Ofir Press

postdoc

4 shared papers

Shunyu Yao

researcher

4 shared papers

Ludwig Schmidt

professor

3 shared papers

Alexander Wettig

researcher

2 shared papers

Binyuan Hui

2 shared papers

Carlos E. Jimenez

2 shared papers

Etash Guha

researcher

2 shared papers

Jeffrey Li

2 shared papers

Jenia Jitsev

2 shared papers