Sophie Xhonneux

Cite

Notes

Only stored in your browser.

Attribution

2papers

Authored papers

Efficient Adversarial Training in LLMs with Continuous Attacks

arXiv 2024

Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models

arXiv 2024

No known affiliations.

from 2 papers

Aaron Courville

Alessandro Sordoni

Arian Hosseini

Gauthier Gidel

Leo Schwinn

Michael Noukhovitch

Rishabh Agarwal

Shengyi "Costa" Huang

researcher

Stephan Günnemann