Hanbin Wang
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4Process Reinforcement through Implicit Rewards
arXiv 2025
UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning
arXiv 2025
From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones
arXiv 2025
Advancing LLM Reasoning Generalists with Preference Trees
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers
Ganqu Cui
researcher
Hao Peng
Lifan Yuan
grad-student
Maosong Sun
professor
Ning Ding
researcher
Zhiyuan Liu
professor
Bowen Zhou
professor
Weize Chen
Aoyan Li
Baoquan Zhong