Cite
Notes
Only stored in your browser.
Attribution
PSPO*: An Effective Process-supervised Policy Optimization for Reasoning Alignment
arXiv 2024
from 1 papers
Chong Feng
Jiawei Li
Xinyue Liang
Yang Gao
Yizhe Yang