Cite
Notes
Only stored in your browser.
Attribution
WPO: Enhancing RLHF with Weighted Preference Optimization
arXiv 2024
from 1 papers
Chenguang Zhu
Kaiqiang Song
Ravi Agrawal
Sathish Reddy Indurthi
Shujian Zhang
Silei Xu
Wenxuan Zhou