Nouha Dziri

2 OLMo 2 Furious

arXiv 2024

RewardBench: Evaluating Reward Models for Language Modeling

arXiv 2024

WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

arXiv 2024

WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models

arXiv 2024

WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild

arXiv 2024

A Roadmap to Pluralistic Alignment

arXiv 2024

AI as Humanity's Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution of Machine Text against Web Text

arXiv 2024

Faith and Fate: Limits of Transformers on Compositionality

faith-and-fate-limits-of-transformers-on

CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos

ICCV 2023 1

Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning

arXiv 2023

Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties

arXiv 2023

Phenomenal Yet Puzzling: Testing Inductive Reasoning Capabilities of Language Models with Hypothesis Refinement

arXiv 2023

Allen Institute for AI (Ai2)

Affiliations

Currently at

researcher · non profit

Previously

Mila - Quebec AI Instituteresearch group

Frequent co-authors

from 16 papers

Yejin Choi

professor

11 shared papers

Liwei Jiang

8 shared papers

Ximing Lu

7 shared papers

Valentina Pyatkin

research-scientist

6 shared papers

Bill Yuchen Lin

researcher

5 shared papers

Faeze Brahman

researcher

5 shared papers

Allyson Ettinger

Khyathi Chandu

Nathan Lambert

researcher

Seungju Han