Michael Backes

Papers: 11

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

11papers

Authored papers

On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective

arXiv 2025

2025

TrustLLM: Trustworthiness in Large Language Models

arXiv 2024

2024

Are We in the AI-Generated Text World Already? Quantifying and Monitoring AIGT on Social Media

arXiv 2024

2024

Memorization in Self-Supervised Learning Improves Downstream Generalization

arXiv 2024

2024

ModSCAN: Measuring Stereotypical Bias in Large Vision-Language Models from Vision and Language Modalities

arXiv 2024

2024

Unsafe Diffusion: On the Generation of Unsafe Images and Hateful Memes From Text-To-Image Models

arXiv 2023

2023

Generated Graph Detection

arXiv 2023

2023

"Do Anything Now": Characterizing and Evaluating In-The-Wild Jailbreak Prompts on Large Language Models

arXiv 2023

2023

MGTBench: Benchmarking Machine-Generated Text Detection

arXiv 2023

2023

Prompt Stealing Attacks Against Text-to-Image Generation Models

arXiv 2023

2023

Data Poisoning Attacks Against Multimodal Encoders

arXiv 2022

2022

Affiliations

No known affiliations.

Frequent co-authors

from 11 papers

Yang Zhang

8 shared papers

Xinyue Shen

6 shared papers

Xinlei He

5 shared papers

Caiming Xiong

researcher

Chaowei Xiao

Chujie Gao

Furong Huang

Haoran Wang

Heng Ji

professor

2 shared papers

huan zhang

2 shared papers