Cite
Notes
Only stored in your browser.
Attribution
Learning Optimal Advantage from Preferences and Mistaking it for Reward
arXiv 2023
from 1 papers
Anca Dragan
Peter Stone
Scott Niekum
Sigurdur Orn Adalgeirsson
Stephane Hatgis-Kessell
W. Bradley Knox