Cite
Notes
Only stored in your browser.
Attribution
Confronting Reward Model Overoptimization with Constrained RLHF
arXiv 2023
from 1 papers
Aaditya K. Singh
DJ Strouse
Ruslan Salakhutdinov
professor
Stephen Mcaleer
Ted Moskovitz
Tuomas Sandholm