Cite
Notes
Only stored in your browser.
Attribution
AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents
arXiv 2024
from 1 papers
Alexandra Souly
Andy Zou
founder
Dan Hendrycks
director
Derek Duenas
Eric Winsor
Justin Wang
Maksym Andriushchenko
Mateusz Dziemian
Matt Fredrikson
Maxwell Lin