0

Danqi Chen

Princeton assistant professor and co-director of Princeton NLP; co-creator of SQuAD-era reading comprehension and open-domain QA.

Role
professor
Papers
40

Cite

Notes

Only stored in your browser.

40papers

Authored papers

40

The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning

arXiv 2025

2025

Goedel-Prover: A Frontier Model for Open-Source Automated Theorem Proving

arXiv 2025

2025

Metadata Conditioning Accelerates Language Model Pre-training

arXiv 2025

2025

Goedel-Prover-V2: Scaling Formal Theorem Proving with Scaffolded Data Synthesis and Self-Correction

arXiv 2025

2025

HELMET: How to Evaluate Long-Context Language Models Effectively and Thoroughly

arXiv 2024

2024

CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs

arXiv 2024

2024

How to Train Long-Context Language Models (Effectively)

arXiv 2024

2024

QuRating: Selecting High-Quality Data for Training Language Models

arXiv 2024

2024

Long-Context Language Modeling with Parallel Context Encoding

arXiv 2024

2024

Language Models as Science Tutors

arXiv 2024

2024

Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization

arXiv 2024

2024

LESS: Selecting Influential Data for Targeted Instruction Tuning

arXiv 2024

2024

SimPO: Simple Preference Optimization with a Reference-Free Reward

arXiv 2024

2024

LitSearch: A Retrieval Benchmark for Scientific Literature Search

arXiv 2024

2024

SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors

arXiv 2024

2024

Fine-Tuning Language Models with Just Forward Passes

fine-tuning-language-models-with-just-forward

2023

Catastrophic Jailbreak of Open-source LLMs via Exploiting Generation

arXiv 2023

2023

Enabling Large Language Models to Generate Text with Citations

arXiv 2023

2023

Adapting Language Models to Compress Contexts

arXiv 2023

2023

What In-Context Learning "Learns" In-Context: Disentangling Task Recognition and Task Learning

arXiv 2023

2023

Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

arXiv 2023

2023

Learning Transformer Programs

learning-transformer-programs

2023

Evaluating Large Language Models at Evaluating Instruction Following

arXiv 2023

2023

MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions

arXiv 2023

2023

C-STS: Conditional Semantic Textual Similarity

arXiv 2023

2023

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

TMLR

2022

MABEL: Attenuating Gender Bias using Textual Entailment Data

arXiv 2022

2022

Structured Pruning Learns Compact and Accurate Models

ACL 2022 5

2022

Should You Mask 15% in Masked Language Modeling?

arXiv 2022

2022

A Kernel-Based View of Language Model Fine-Tuning

arXiv 2022

2022

SimCSE: Simple Contrastive Learning of Sentence Embeddings

EMNLP 2021 11

2021

Simple Entity-Centric Questions Challenge Dense Retrievers

EMNLP 2021 11

2021

A Frustratingly Easy Approach for Entity and Relation Extraction

NAACL 2021 4

2020

Dense Passage Retrieval for Open-Domain Question Answering

EMNLP 2020 11

2020

Making Pre-trained Language Models Better Few-shot Learners

ACL 2021 5

2020

Learning Dense Representations of Phrases at Scale

ACL 2021 5

2020

SpanBERT: Improving Pre-training by Representing and Predicting Spans

spanbert-improving-pre-training-by-1

2019

MRQA 2019 Shared Task: Evaluating Generalization in Reading Comprehension

mrqa-2019-shared-task-evaluating-1

2019

Reading Wikipedia to Answer Open-Domain Questions

reading-wikipedia-to-answer-open-domain-1

2017

A Thorough Examination of the CNN/Daily Mail Reading Comprehension Task

a-thorough-examination-of-the-cnndaily-mail-1

2016

Affiliations

Currently at

Princeton NLP Group

professor · university lab

Frequent co-authors

10

from 40 papers