Cite
Notes
Only stored in your browser.
Attribution
Weak-to-Strong Jailbreaking on Large Language Models
arXiv 2024
Protecting Language Generation Models via Invisible Watermarking
arXiv 2023
Provable Robust Watermarking for AI-Generated Text
Differentially Private Optimization on Large Model at Small Cost
arXiv 2022
from 4 papers
Lei LI
Xuandong Zhao
Chao Du
George Karypis
Prabhanjan Ananth
Sheng Zha
Tianyu Pang
William Yang Wang
Xianjun Yang
Zhiqi Bu