Cite
Notes
Only stored in your browser.
Attribution
One Turn Too Late: Response-Aware Defense Against Hidden Malicious Intent in Multi-Turn Dialogue
arXiv 2026
The Trojan Knowledge: Bypassing Commercial LLM Guardrails via Harmless Prompt Weaving and Adaptive Tree Search
arXiv 2025
from 2 papers
Eli Chien
Pan Li
Peizhi Niu
Pin-Yu Chen
Ruihan Wu
Xinjie Shen
Bo Li
Haoyu Wang
Olgica Milenkovic
Tony Tu