Cite
Notes
Only stored in your browser.
Attribution
Complex Logical Instruction Generation
arXiv 2025
WPO: Enhancing RLHF with Weighted Preference Optimization
arXiv 2024
from 2 papers
Kaiqiang Song
Chenguang Zhu
Haoyun Deng
Mian Zhang
Ming Yin
Ravi Agrawal
Sanqiang Zhao
Shujian Liu
Shujian Zhang
Silei Xu