Cite
Notes
Only stored in your browser.
Attribution
RealHarm: A Collection of Real-World Language Model Application Failures
arXiv 2025
Phare: A Safety Probe for Large Language Models
from 2 papers
Matteo Dora
Benoît Malézieux
Jiaen Liu
Luca Rossi
Weixuan Xiao