Cite
Notes
Only stored in your browser.
Attribution
Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning
arXiv 2025
ProAPO: Progressively Automatic Prompt Optimization for Visual Classification
CVPR 2025 1
Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining
arXiv 2024
from 3 papers
Jie Cheng
Ruixi Qiao
Yisheng Lv
Binhua Li
Chao Guo
Fei-Yue Wang
Gaopeng Gou
Jiamin Zhuang
Jing Yu
Junle Wang