Cite
Notes
Only stored in your browser.
Attribution
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models
arXiv 2024
Self-Play Preference Optimization for Language Model Alignment
P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally Across Scales and Tasks
arXiv 2021
from 3 papers
Huizhuo Yuan
Quanquan Gu
Jie Tang
engineer
Weng Lam Tam
Xiao Liu
Yicheng Fu
Yihe Deng
Yiming Yang
professor
Yue Wu
Zhengxiao Du