Gauthier Gidel
- Papers
- 7
Cite
Notes
Only stored in your browser.
7papers
Authored papers
7Efficient Adversarial Training in LLMs with Continuous Attacks
arXiv 2024
Learning diverse attacks on large language models for robust red-teaming and safety tuning
arXiv 2024
Expected flow networks in stochastic environments and two-player zero-sum games
arXiv 2023
On the Stability of Iterative Retraining of Generative Models on their own Data
arXiv 2023
Synaptic Weight Distributions Depend on the Geometry of Plasticity
arXiv 2023
Convergence of Proximal Point and Extragradient-Based Methods Beyond Monotonicity: the Case of Negative Comonotonicity
arXiv 2022
Online Adversarial Attacks
online-adversarial-attacks-1
Affiliations
No known affiliations.
Frequent co-authors
10from 7 papers