Hangyu Mao
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5Agentic Reinforced Policy Optimization
arXiv 2025
Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement Learning
arXiv 2025
Agentic Entropy-Balanced Policy Optimization
arXiv 2025
GARDO: Reinforcing Diffusion Models without Reward Hacking
arXiv 2025
PET-SQL: A Prompt-Enhanced Two-Round Refinement of Text-to-SQL with Cross-consistency
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers