Alexander Wettig

Princeton NLP PhD; co-author on SWE-bench, ShortLong-Doc, and the QuRating / data-selection line of pretraining-data work.

Role: researcher
Currently at: Princeton NLP Group
Twitter: twitter.com/_awettig
GitHub: github.com/awettig
Scholar: scholar.google.com/citations
Papers: 12

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: scholar.google.com/citations

Attribution policy →

12papers

Authored papers

SWE-smith: Scaling Data for Software Engineering Agents

arXiv 2025

2025

Olmo 3

arXiv 2025

2025

Metadata Conditioning Accelerates Language Model Pre-training

arXiv 2025

2025

SWE-bench: Can Language Models Resolve Real-World GitHub Issues?

ICLR

2024

OLMoE: Open Mixture-of-Experts Language Models

arXiv 2024

2024

How to Train Long-Context Language Models (Effectively)

arXiv 2024

2024

QuRating: Selecting High-Quality Data for Training Language Models

arXiv 2024

2024

Language Models as Science Tutors

arXiv 2024

2024

Adapting Language Models to Compress Contexts

arXiv 2023

2023

Learning Transformer Programs

learning-transformer-programs

2023

Should You Mask 15% in Masked Language Modeling?

arXiv 2022

2022

A Kernel-Based View of Language Model Fine-Tuning

arXiv 2022

2022

Affiliations

Currently at

Princeton NLP Group

researcher · university lab

Frequent co-authors

from 12 papers

Danqi Chen

professor

Tianyu Gao

Akshita Bhagia

Alexis Chevalier

Ali Farhadi

CEO

Binyuan Hui

Dirk Groeneveld

Dustin Schwenk

Hannaneh Hajishirzi

professor

2 shared papers

Jacob Morrison

research-engineer

2 shared papers