Cite
Notes
Only stored in your browser.
Attribution
PSPO*: An Effective Process-supervised Policy Optimization for Reasoning Alignment
arXiv 2024
from 1 papers
Jiawei Li
Junlong Zhang
Xinyue Liang
Yang Gao
Yizhe Yang