Junqi Gao
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4Graph Counselor: Adaptive Graph Exploration via Multi-Agent Synergy to Enhance LLM Reasoning
arXiv 2025
GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning
arXiv 2025
A Survey of Reinforcement Learning for Large Reasoning Models
arXiv 2025
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling
arXiv 2025
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers