Jianhao Yan
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11BAPO: Boundary-Aware Policy Optimization for Reliable Agentic Search
arXiv 2026
P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads
arXiv 2026
Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning
arXiv 2026
Detecting RLVR Training Data via Structural Convergence of Reasoning
arXiv 2026
Learning to Reason under Off-Policy Guidance
arXiv 2025
A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond
arXiv 2025
See What LLMs Cannot Answer: A Self-Challenge Framework for Uncovering LLM Weaknesses
arXiv 2024
Understanding In-Context Learning from Repetitions
arXiv 2023
Revisiting Cross-Lingual Summarization: A Corpus-based Study and A New Benchmark with Improved Annotation
arXiv 2023
Dynamics of Instruction Tuning: Each Ability of Large Language Models Has Its Own Growth Pace
arXiv 2023
Learning to Break the Loop: Analyzing and Mitigating Repetitions for Neural Text Generation
arXiv 2022
Affiliations
Frequent co-authors
10from 11 papers