Cite
Notes
Only stored in your browser.
Attribution
Query-Policy Misalignment in Preference-Based Reinforcement Learning
arXiv 2023
from 1 papers
Jianxiong Li
Xianyuan Zhan
Xiao Hu
Ya-Qin Zhang