Jindong Gu
- Papers
- 22
Cite
Notes
Only stored in your browser.
Authored papers
22REAL: Benchmarking Autonomous Agents on Deterministic Simulations of Real Websites
arXiv 2025
Safety at Scale: A Comprehensive Survey of Large Model Safety
arXiv 2025
Open Eyes, Then Reason: Fine-grained Visual Mathematical Understanding in MLLMs
arXiv 2025
Exploring Typographic Visual Prompts Injection Threats in Cross-Modality Generation Models
arXiv 2025
A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment
arXiv 2025
Improved Techniques for Optimization-Based Jailbreaking on Large Language Models
arXiv 2024
Can Large Language Model Agents Simulate Human Trust Behavior?
arXiv 2024
Inducing High Energy-Latency of Large Vision-Language Models with Verbose Images
arXiv 2024
ReVisionLLM: Recursive Vision-Language Model for Temporal Grounding in Hour-Long Videos
CVPR 2025 1
Can Editing LLMs Inject Harm?
arXiv 2024
Stop Reasoning! When Multimodal LLM with Chain-of-Thought Reasoning Meets Adversarial Image
arXiv 2024
Multimodal Pragmatic Jailbreak on Text-to-image Models
arXiv 2024
Which Model Generated This Image? A Model-Agnostic Approach for Origin Attribution
arXiv 2024
Dataset Distillation by Automatic Training Trajectories
arXiv 2024
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models
arXiv 2024
Red Teaming GPT-4V: Are GPT-4V Safe Against Uni/Multi-Modal Jailbreak Attacks?
arXiv 2024
MM-SafetyBench: A Benchmark for Safety Evaluation of Multimodal Large Language Models
arXiv 2023
Influencer Backdoor Attack on Semantic Segmentation
arXiv 2023
XAI for In-hospital Mortality Prediction via Multimodal ICU Data
arXiv 2023
A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models
arXiv 2023
Multi-event Video-Text Retrieval
ICCV 2023 1
FRAug: Tackling Federated Learning with Non-IID Features via Representation Augmentation
ICCV 2023 1
Affiliations
Frequent co-authors
10from 22 papers