Haoran Sun
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10A Very Big Video Reasoning Suite
arXiv 2026
Missing Old Logits in Asynchronous Agentic RL: Semantic Mismatch and Repair Methods for Off-Policy Correction
arXiv 2026
Beyond Static Tools: Test-Time Tool Evolution for Scientific Reasoning
arXiv 2026
Enhancing Step-by-Step and Verifiable Medical Reasoning in MLLMs
arXiv 2025
SciEvalKit: An Open-source Evaluation Toolkit for Scientific General Intelligence
arXiv 2025
Curiosity-Driven Reinforcement Learning from Human Feedback
arXiv 2025
Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models
arXiv 2024
Multilingual Large Language Models: A Systematic Survey
arXiv 2024
MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions
arXiv 2024
FuxiTranyu: A Multilingual Large Language Model Trained with Balanced Data
arXiv 2024
Affiliations
Frequent co-authors
10from 10 papers