Cite
Notes
Only stored in your browser.
Attribution
Self-Improving Multilingual Long Reasoning via Translation-Reasoning Integrated Training
arXiv 2026
R-PRM: Reasoning-Driven Process Reward Modeling
arXiv 2025
PATS: Process-Level Adaptive Thinking Mode Switching
Process-based Self-Rewarding Language Models
from 4 papers
ShuJian Huang
Jiajun Chen
Shimao Zhang
Xin Huang
Junlan Feng
Liqian Huang
Shuaijie She
Xiao Liu
Xin Zhang
Xue Han