Felix Friedrich
- Papers
- 12
Cite
Notes
Only stored in your browser.
Authored papers
12MSTS: A Multimodal Safety Test Suite for Vision-Language Models
arXiv 2025
ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming
arXiv 2024
LlavaGuard: An Open VLM-based Framework for Safeguarding Vision Datasets and Models
arXiv 2024
SCAR: Sparse Conditioned Autoencoders for Concept Detection and Steering in LLMs
arXiv 2024
Multilingual Text-to-Image Generation Magnifies Gender Stereotypes and Prompt Engineering May Not Help You
arXiv 2024
Fair Diffusion: Instructing Text-to-Image Generation Models on Fairness
arXiv 2023
Class Attribute Inference Attacks: Inferring Sensitive Class Information by Diffusion-Based Attribute Manipulations
arXiv 2023
Revision Transformers: Instructing Language Models to Change their Values
arXiv 2022
Does CLIP Know My Face?
arXiv 2022
Exploiting Cultural Biases via Homoglyphs in Text-to-Image Synthesis
arXiv 2022
A Typology for Exploring the Mitigation of Shortcut Behavior
arXiv 2022
Interactively Providing Explanations for Transformer Language Models
interactively-generating-explanations-for-1
Affiliations
Frequent co-authors
10from 12 papers
Patrick Schramowski
Kristian Kersting
Manuel Brack
Dominik Hintersdorf
Lukas Struppek
Wolfgang Stammer
Alexander Fraser
Alicia Parrish
Anastassia Shaitarova
Andrea Zugarini