Cite
Notes
Only stored in your browser.
Attribution
GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification
arXiv 2026
from 1 papers
Jianwei Yin
Jintao Chen
Linbo Xi
Wangjie Gan
Wenqi Zhang
Xuhong Zhang