Danqi Chen
Princeton assistant professor and co-director of Princeton NLP; co-creator of SQuAD-era reading comprehension and open-domain QA.
- Role
- professor
- Currently at
- Princeton NLP Group
- twitter.com/danqi_chen
- Scholar
- scholar.google.com/citations
- Papers
- 40
Cite
Notes
Only stored in your browser.
Authored papers
40The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning
arXiv 2025
Goedel-Prover: A Frontier Model for Open-Source Automated Theorem Proving
arXiv 2025
Metadata Conditioning Accelerates Language Model Pre-training
arXiv 2025
Goedel-Prover-V2: Scaling Formal Theorem Proving with Scaffolded Data Synthesis and Self-Correction
arXiv 2025
HELMET: How to Evaluate Long-Context Language Models Effectively and Thoroughly
arXiv 2024
CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs
arXiv 2024
How to Train Long-Context Language Models (Effectively)
arXiv 2024
QuRating: Selecting High-Quality Data for Training Language Models
arXiv 2024
Long-Context Language Modeling with Parallel Context Encoding
arXiv 2024
Language Models as Science Tutors
arXiv 2024
Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization
arXiv 2024
LESS: Selecting Influential Data for Targeted Instruction Tuning
arXiv 2024
SimPO: Simple Preference Optimization with a Reference-Free Reward
arXiv 2024
LitSearch: A Retrieval Benchmark for Scientific Literature Search
arXiv 2024
SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors
arXiv 2024
Fine-Tuning Language Models with Just Forward Passes
fine-tuning-language-models-with-just-forward
Catastrophic Jailbreak of Open-source LLMs via Exploiting Generation
arXiv 2023
Enabling Large Language Models to Generate Text with Citations
arXiv 2023
Adapting Language Models to Compress Contexts
arXiv 2023
What In-Context Learning "Learns" In-Context: Disentangling Task Recognition and Task Learning
arXiv 2023
Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
arXiv 2023
Learning Transformer Programs
learning-transformer-programs
Evaluating Large Language Models at Evaluating Instruction Following
arXiv 2023
MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions
arXiv 2023
C-STS: Conditional Semantic Textual Similarity
arXiv 2023
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
TMLR
MABEL: Attenuating Gender Bias using Textual Entailment Data
arXiv 2022
Structured Pruning Learns Compact and Accurate Models
ACL 2022 5
Should You Mask 15% in Masked Language Modeling?
arXiv 2022
A Kernel-Based View of Language Model Fine-Tuning
arXiv 2022
SimCSE: Simple Contrastive Learning of Sentence Embeddings
EMNLP 2021 11
Simple Entity-Centric Questions Challenge Dense Retrievers
EMNLP 2021 11
A Frustratingly Easy Approach for Entity and Relation Extraction
NAACL 2021 4
Dense Passage Retrieval for Open-Domain Question Answering
EMNLP 2020 11
Making Pre-trained Language Models Better Few-shot Learners
ACL 2021 5
Learning Dense Representations of Phrases at Scale
ACL 2021 5
SpanBERT: Improving Pre-training by Representing and Predicting Spans
spanbert-improving-pre-training-by-1
MRQA 2019 Shared Task: Evaluating Generalization in Reading Comprehension
mrqa-2019-shared-task-evaluating-1
Reading Wikipedia to Answer Open-Domain Questions
reading-wikipedia-to-answer-open-domain-1
A Thorough Examination of the CNN/Daily Mail Reading Comprehension Task
a-thorough-examination-of-the-cnndaily-mail-1
Affiliations
Frequent co-authors
10from 40 papers