Sewon Min
- Papers
- 28
Cite
Notes
Only stored in your browser.
Authored papers
28EMO: Pretraining Mixture of Experts for Emergent Modularity
arXiv 2026
Residual Context Diffusion Language Models
arXiv 2026
Learning to Detect Language Model Training Data via Active Reconstruction
arXiv 2026
VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents
arXiv 2026
OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens
arXiv 2025
ReasonIR: Training Retrievers for Reasoning Tasks
arXiv 2025
FrontierCS: Evolving Challenges for Evolving Intelligence
arXiv 2025
DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research
arXiv 2025
Spurious Rewards: Rethinking Training Signals in RLVR
arXiv 2025
Constantly Improving Image Models Need Constantly Improving Benchmarks
arXiv 2025
FlexOlmo: Open Language Models for Flexible Data Use
arXiv 2025
OLMoE: Open Mixture-of-Experts Language Models
arXiv 2024
Scaling Retrieval-Based Language Models with a Trillion-Token Datastore
arXiv 2024
Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens
arXiv 2024
Do Membership Inference Attacks Work on Large Language Models?
arXiv 2024
SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore
arXiv 2023
In-context Pretraining: Language Modeling Beyond Document Boundaries
arXiv 2023
FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation
arXiv 2023
BTR: Binary Token Representations for Efficient Retrieval Augmented Language Models
arXiv 2023
Measuring and Narrowing the Compositionality Gap in Language Models
arXiv 2022
Rethinking the Role of Demonstrations: What Makes In-Context Learning Work?
arXiv 2022
Nonparametric Masked Language Modeling
arXiv 2022
Towards Understanding Chain-of-Thought Prompting: An Empirical Study of What Matters
arXiv 2022
CREPE: Open-Domain Question Answering with False Presuppositions
arXiv 2022
MetaICL: Learning to Learn In Context
NAACL 2022 7
Prompt Waywardness: The Curious Case of Discretized Interpretation of Continuous Prompts
NAACL 2022 7
Dense Passage Retrieval for Open-Domain Question Answering
EMNLP 2020 11
UnifiedQA: Crossing Format Boundaries With a Single QA System
Findings of the Association for Computational Linguistics 2020
Affiliations
Frequent co-authors
10from 28 papers
Hannaneh Hajishirzi
professor
Luke Zettlemoyer
professor
Pang Wei Koh
Mike Lewis
Weijia Shi
researcher
Noah A. Smith
Wen-tau Yih
Luca Soldaini
Rulin Shao
Yejin Choi
professor