Cite
Notes
Only stored in your browser.
Attribution
Fine-Tuning Language Models with Advantage-Induced Policy Alignment
arXiv 2023
from 1 papers
Banghua Zhu
professor
Chenguang Zhu
Felipe Vieira Frujeri
Hiteshi Sharma
Jiantao Jiao
Michael. I. Jordan