Cite
Notes
Only stored in your browser.
Attribution
Fine-Tuning Language Models from Human Preferences
arXiv 2019
Adversarial Patch
arXiv 2017
from 2 papers
Alec Radford
researcher
Aurko Roy
Dandelion Mané
Daniel M. Ziegler
Dario Amodei
CEO
Geoffrey Irving
Jeffrey Wu
Justin Gilmer
Martín Abadi
Nisan Stiennon