Nathan Lambert

Olmo 3

arXiv 2025

RewardBench 2: Advancing Reward Model Evaluation

preprint

Spurious Rewards: Rethinking Training Signals in RLVR

arXiv 2025

2 OLMo 2 Furious

arXiv 2024

OLMo: Accelerating the Science of Language Models

arXiv 2024

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models

CVPR 2025 1

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

arXiv 2024

OLMoE: Open Mixture-of-Experts Language Models

arXiv 2024

RewardBench: Evaluating Reward Models for Language Modeling

arXiv 2024

WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

arXiv 2024

M-RewardBench: Evaluating Reward Models in Multilingual Settings

arXiv 2024

A Survey on Data Selection for Language Models

arXiv 2024

Tulu 3: Pushing Frontiers in Open Language Model Post-Training

preprint

D2PO: Discriminator-Guided DPO with Response Evaluation Models

arXiv 2024

SFT DatasetInstruction FollowingMathCode Generation

Reward Reports for Reinforcement Learning

arXiv 2022

2022

Eval contributions

RewardBench 2

Allen Institute for AI (Ai2)

2025 successor to RewardBench - harder, multi-completion (not just chosen-vs-rejected), with refreshed prompts to address contamination.

ActiveLLM JudgingSafety

RewardBench

Allen Institute for AI (Ai2)

2,985 prompt-chosen-rejected triples across chat, reasoning, safety, and code - a benchmark for evaluating reward models and LLM judges.

ActiveLLM JudgingSafety

Tool contributions

Tülu 3 SFT Mixture

Allen Institute for AI (Ai2)

Allen AI's flagship open SFT mixture combining new persona-driven prompts with curated public data for post-training a frontier-quality instruct model.

Affiliations

Currently at

Allen Institute for AI (Ai2)

researcher · non profit

Previously

Hugging Faceinfra University of California, Berkeleyuniversity lab

Frequent co-authors

from 17 papers

Hannaneh Hajishirzi

professor

10 shared papers

Noah A. Smith

9 shared papers

Jacob Morrison

research-engineer

8 shared papers

Luca Soldaini

7 shared papers

Dirk Groeneveld

Hamish Ivison

grad-student

Kyle Lo

Pete Walsh

Valentina Pyatkin

research-scientist