Zhou Ziheng

Cite

Notes

Only stored in your browser.

Attribution

1papers

Authored papers

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

arXiv 2025

No known affiliations.

from 1 papers

Lu Qi

Ming-Hsuan Yang

Wenbo Zhu

Xinting Hu

Xinyu Ye

Xu Yang

Yingzhe Peng

Yizhou Zhou

Yongliang Wu