Yuxin Wen
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs
arXiv 2024
Coercing LLMs to do and reveal (almost) anything
arXiv 2024
WAVES: Benchmarking the Robustness of Image Watermarks
arXiv 2024
NEFTune: Noisy Embeddings Improve Instruction Finetuning
arXiv 2023
On the Reliability of Watermarks for Large Language Models
arXiv 2023
Hard Prompts Made Easy: Gradient-Based Discrete Optimization for Prompt Tuning and Discovery
hard-prompts-made-easy-gradient-based
Tree-Ring Watermarks: Fingerprints for Diffusion Images that are Invisible and Robust
arXiv 2023
Bring Your Own Data! Self-Supervised Evaluation for Large Language Models
arXiv 2023
Canary in a Coalmine: Better Membership Inference with Ensembled Adversarial Queries
arXiv 2022
Affiliations
Frequent co-authors
10from 9 papers