Shenzhi Yang
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5Can LLMs Learn to Reason Robustly under Noisy Supervision?
arXiv 2026
AgentProcessBench: Diagnosing Step-Level Process Quality in Tool-Using Agents
arXiv 2026
TraPO: A Semi-Supervised Reinforcement Learning Framework for Boosting LLM Reasoning
arXiv 2025
AgentCPM-GUI: Building Mobile-Use Agents with Reinforcement Fine-Tuning
arXiv 2025
Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers