Cite
Notes
Only stored in your browser.
Attribution
Pre-DPO: Improving Data Utilization in Direct Preference Optimization Using a Guiding Reference Model
arXiv 2025
from 1 papers
Junshu Pan
Qiji Zhou
Wei Shen
Yue Zhang