0

Chaowei Xiao

Papers
29

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
29papers

Authored papers

29

DecodingTrust-Agent Platform (DTap): A Controllable and Interactive Red-Teaming Platform for AI Agents

arXiv 2026

2026

AgentSys: Secure and Dynamic LLM Agents Through Explicit Hierarchical Memory Management

arXiv 2026

2026

FORTIS: Benchmarking Over-Privilege in Agent Skills

arXiv 2026

2026

On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective

arXiv 2025

2025

Safety at Scale: A Comprehensive Survey of Large Model Safety

arXiv 2025

2025

JailDAM: Jailbreak Detection with Adaptive Memory for Vision-Language Model

arXiv 2025

2025

AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs

arXiv 2024

2024

TrustLLM: Trustworthiness in Large Language Models

arXiv 2024

2024

AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases

arXiv 2024

2024

LeanAgent: Lifelong Learning for Formal Theorem Proving

arXiv 2024

2024

MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding

arXiv 2024

2024

T-Stitch: Accelerating Sampling in Pre-Trained Diffusion Models with Trajectory Stitching

arXiv 2024

2024

UniGen: A Unified Framework for Textual Dataset Generation Using Large Language Models

arXiv 2024

2024

Instructional Fingerprinting of Large Language Models

arXiv 2024

2024

EIA: Environmental Injection Attack on Generalist Web Agents for Privacy Leakage

arXiv 2024

2024

Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset

arXiv 2024

2024

A Trembling House of Cards? Mapping Adversarial Attacks against Language Agents

arXiv 2024

2024

Can Editing LLMs Inject Harm?

arXiv 2024

2024

HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection

arXiv 2024

2024

Voyager: An Open-Ended Embodied Agent with Large Language Models

arXiv 2023

2023

VoxFormer: Sparse Voxel Transformer for Camera-based 3D Semantic Scene Completion

CVPR 2023 1

2023

Prismer: A Vision-Language Model with Multi-Task Experts

arXiv 2023

2023

AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language Models

arXiv 2023

2023

A Text-guided Protein Design Framework

arXiv 2023

2023

ChatGPT-powered Conversational Drug Editing Using Retrieval and Domain Feedback

arXiv 2023

2023

On the Exploitability of Instruction Tuning

on-the-exploitability-of-instruction-tuning

2023

Multi-modal Molecule Structure-text Model for Text-based Retrieval and Editing

arXiv 2022

2022

Diffusion Models for Adversarial Purification

arXiv 2022

2022

Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language Models

arXiv 2022

2022

Affiliations

No known affiliations.

Frequent co-authors

10

from 29 papers