Norman Mu

Papers: 7

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

7papers

Authored papers

HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal

arXiv 2024

2024

Can LLMs Follow Simple Rules?

arXiv 2023

2023

Mark My Words: Analyzing and Evaluating Language Model Watermarks

arXiv 2023

2023

SLIP: Self-supervision meets Language-Image Pre-training

arXiv 2021

2021

The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization

ICCV 2021 10

2020

MNIST-C: A Robustness Benchmark for Computer Vision

arXiv 2019

2019

AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty

ICLR 2020 1

2019

Affiliations

No known affiliations.

Frequent co-authors

from 7 papers

Dan Hendrycks

director

4 shared papers

David Wagner

3 shared papers

Justin Gilmer

3 shared papers

Steven Basart

researcher

2 shared papers

Zifan Wang

2 shared papers

Alexander Kirillov

1 shared paper

Andy Zou

founder

1 shared paper

Balaji Lakshminarayanan

1 shared paper

Barret Zoph

founder

1 shared paper

Basel Alomair

1 shared paper