Cite
Notes
Only stored in your browser.
Attribution
Rewrite to Jailbreak: Discover Learnable and Transferable Implicit Harmfulness Instruction
arXiv 2025
Goal-Oriented Prompt Attack and Safety Evaluation for LLMs
arXiv 2023
from 2 papers
Fei Wu
Kun Kuang
Changlong Sun
Chao Wu
Fubang Zhao
Lizhi Qing
Yangyang Kang
Yifeng Feng
YuTing Huang