Chaowei Xiao
- Papers
- 29
Cite
Notes
Only stored in your browser.
Authored papers
29DecodingTrust-Agent Platform (DTap): A Controllable and Interactive Red-Teaming Platform for AI Agents
arXiv 2026
AgentSys: Secure and Dynamic LLM Agents Through Explicit Hierarchical Memory Management
arXiv 2026
FORTIS: Benchmarking Over-Privilege in Agent Skills
arXiv 2026
On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective
arXiv 2025
Safety at Scale: A Comprehensive Survey of Large Model Safety
arXiv 2025
JailDAM: Jailbreak Detection with Adaptive Memory for Vision-Language Model
arXiv 2025
AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs
arXiv 2024
TrustLLM: Trustworthiness in Large Language Models
arXiv 2024
AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases
arXiv 2024
LeanAgent: Lifelong Learning for Formal Theorem Proving
arXiv 2024
MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
arXiv 2024
T-Stitch: Accelerating Sampling in Pre-Trained Diffusion Models with Trajectory Stitching
arXiv 2024
UniGen: A Unified Framework for Textual Dataset Generation Using Large Language Models
arXiv 2024
Instructional Fingerprinting of Large Language Models
arXiv 2024
EIA: Environmental Injection Attack on Generalist Web Agents for Privacy Leakage
arXiv 2024
Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset
arXiv 2024
A Trembling House of Cards? Mapping Adversarial Attacks against Language Agents
arXiv 2024
Can Editing LLMs Inject Harm?
arXiv 2024
HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection
arXiv 2024
Voyager: An Open-Ended Embodied Agent with Large Language Models
arXiv 2023
VoxFormer: Sparse Voxel Transformer for Camera-based 3D Semantic Scene Completion
CVPR 2023 1
Prismer: A Vision-Language Model with Multi-Task Experts
arXiv 2023
AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language Models
arXiv 2023
A Text-guided Protein Design Framework
arXiv 2023
ChatGPT-powered Conversational Drug Editing Using Retrieval and Domain Feedback
arXiv 2023
On the Exploitability of Instruction Tuning
on-the-exploitability-of-instruction-tuning
Multi-modal Molecule Structure-text Model for Text-based Retrieval and Editing
arXiv 2022
Diffusion Models for Adversarial Purification
arXiv 2022
Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language Models
arXiv 2022
Affiliations
Frequent co-authors
10from 29 papers