Cite
Notes
Only stored in your browser.
Attribution
WPO: Enhancing RLHF with Weighted Preference Optimization
arXiv 2024
from 1 papers
Chenguang Zhu
Kaiqiang Song
Sanqiang Zhao
Sathish Reddy Indurthi
Shujian Zhang
Silei Xu
Wenxuan Zhou