Huanjin Yao
- Papers
- 7
Cite
Notes
Only stored in your browser.
7papers
Authored papers
7DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation
arXiv 2026
R1-ShareVL: Incentivizing Reasoning Capability of Multimodal Large Language Models via Share-GRPO
arXiv 2025
R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search
arXiv 2025
MAPO: Mixed Advantage Policy Optimization
arXiv 2025
R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization
arXiv 2025
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search
arXiv 2024
Dense Connector for MLLMs
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 7 papers