Cite
Notes
Only stored in your browser.
Attribution
Advantage-Guided Distillation for Preference Alignment in Small Language Models
arXiv 2025
from 1 papers
Fanqi Wan
Qifan Wang
Shiping Gao
Xiaojun Quan