Cite
Notes
Only stored in your browser.
Attribution
Knowledge-to-Jailbreak: Investigating Knowledge-driven Jailbreaking Attacks for Large Language Models
arXiv 2024
WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models
arXiv 2023
from 2 papers
Jifan Yu
Juanzi Li
Lei Hou
Shangqing Tu
Hongning Wang
Wenxuan Wang
Yushi Bai
Zhexin Zhang
Zhuoran Pan