Norman Mu
- Papers
- 7
Cite
Notes
Only stored in your browser.
7papers
Authored papers
7HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
arXiv 2024
Can LLMs Follow Simple Rules?
arXiv 2023
Mark My Words: Analyzing and Evaluating Language Model Watermarks
arXiv 2023
SLIP: Self-supervision meets Language-Image Pre-training
arXiv 2021
The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization
ICCV 2021 10
MNIST-C: A Robustness Benchmark for Computer Vision
arXiv 2019
AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty
ICLR 2020 1
Affiliations
No known affiliations.
Frequent co-authors
10from 7 papers
Dan Hendrycks
director
David Wagner
Justin Gilmer
Steven Basart
researcher
Zifan Wang
Alexander Kirillov
Andy Zou
founder
Balaji Lakshminarayanan
Barret Zoph
founder
Basel Alomair