Cite
Notes
Only stored in your browser.
Attribution
Contrastive Preference Learning: Learning from Human Feedback without RL
arXiv 2023
Dual RL: Unification and New Methods for Reinforcement and Imitation Learning
from 2 papers
Scott Niekum
Amy Zhang
Chelsea Finn
Dorsa Sadigh
Joey Hejna
Qinqing Zheng
Rafael Rafailov
W. Bradley Knox