Jindong Gu

Safety at Scale: A Comprehensive Survey of Large Model Safety

arXiv 2025

Exploring Typographic Visual Prompts Injection Threats in Cross-Modality Generation Models

arXiv 2025

A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment

arXiv 2025

Open Eyes, Then Reason: Fine-grained Visual Mathematical Understanding in MLLMs

arXiv 2025

Improved Techniques for Optimization-Based Jailbreaking on Large Language Models

arXiv 2024

When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models

arXiv 2024

Dataset Distillation by Automatic Training Trajectories

arXiv 2024

Red Teaming GPT-4V: Are GPT-4V Safe Against Uni/Multi-Modal Jailbreak Attacks?

arXiv 2024

Can Large Language Model Agents Simulate Human Trust Behavior?

arXiv 2024

Inducing High Energy-Latency of Large Vision-Language Models with Verbose Images

arXiv 2024

ReVisionLLM: Recursive Vision-Language Model for Temporal Grounding in Hour-Long Videos

CVPR 2025 1

Can Editing LLMs Inject Harm?

arXiv 2024

Stop Reasoning! When Multimodal LLM with Chain-of-Thought Reasoning Meets Adversarial Image

arXiv 2024

Multimodal Pragmatic Jailbreak on Text-to-image Models

arXiv 2024

Which Model Generated This Image? A Model-Agnostic Approach for Origin Attribution

arXiv 2024

MM-SafetyBench: A Benchmark for Safety Evaluation of Multimodal Large Language Models

arXiv 2023

A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models

arXiv 2023

XAI for In-hospital Mortality Prediction via Multimodal ICU Data

arXiv 2023

Multi-event Video-Text Retrieval

ICCV 2023 1

Influencer Backdoor Attack on Semantic Segmentation

arXiv 2023