Nouha Dziri
Research scientist at Allen AI; works on LLM reasoning, safety, and evaluation; lead on AI2 Faith and Fate / compositionality and the Tulu post-training line.
- Role
- researcher
- Currently at
- Allen Institute for AI (Ai2)
- twitter.com/nouhadziri
- GitHub
- github.com/nouhadziri
- Scholar
- scholar.google.com/citations
- Papers
- 16
Cite
Notes
Only stored in your browser.
Authored papers
16Emergent Social Intelligence Risks in Generative Multi-Agent Systems
arXiv 2026
On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective
arXiv 2025
Climbing the Ladder of Reasoning: What LLMs Can-and Still Can't-Solve after SFT?
arXiv 2025
Tulu 3: Pushing Frontiers in Open Language Model Post-Training
preprint
2 OLMo 2 Furious
arXiv 2024
RewardBench: Evaluating Reward Models for Language Modeling
arXiv 2024
WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs
arXiv 2024
WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models
arXiv 2024
WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild
arXiv 2024
AI as Humanity's Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution of Machine Text against Web Text
arXiv 2024
A Roadmap to Pluralistic Alignment
arXiv 2024
Faith and Fate: Limits of Transformers on Compositionality
faith-and-fate-limits-of-transformers-on
Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning
arXiv 2023
Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties
arXiv 2023
CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos
ICCV 2023 1
Phenomenal Yet Puzzling: Testing Inductive Reasoning Capabilities of Language Models with Hypothesis Refinement
arXiv 2023
Affiliations
Previously
Frequent co-authors
10from 16 papers
Yejin Choi
professor
Liwei Jiang
Ximing Lu
Valentina Pyatkin
research-scientist
Bill Yuchen Lin
researcher
Faeze Brahman
researcher
Allyson Ettinger
Khyathi Chandu
Nathan Lambert
researcher
Seungju Han