Cite
Notes
Only stored in your browser.
Attribution
SPPO: Sequence-Level PPO for Long-Horizon Reasoning Tasks
arXiv 2026
from 1 papers
Guanhua Chen
Long Li
Peng Li
Shaohan Huang
Tianyi Wang
Yang Liu
Yixia Li
Yun Chen