Cite
Notes
Only stored in your browser.
Attribution
Reasoning-to-Defend: Safety-Aware Reasoning Can Defend Large Language Models from Jailbreaking
arXiv 2025
ATM: Adversarial Tuning Multi-agent System Makes a Robust Retrieval-Augmented Generator
arXiv 2024
from 2 papers
Dawei Yin
Lei Sha
Lingyong Yan
Haibo Shi
Shuaiqiang Wang