Xing Yu
- Papers
- 7
Cite
Notes
Only stored in your browser.
7papers
Authored papers
7Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information
arXiv 2026
REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents
arXiv 2026
VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training
arXiv 2026
Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning
arXiv 2026
DeepEyes: Incentivizing "Thinking with Images" via Reinforcement Learning
arXiv 2025
DeepEyesV2: Toward Agentic Multimodal Model
arXiv 2025
Cheems: A Practical Guidance for Building and Evaluating Chinese Reward Models from Scratch
arXiv 2025
Affiliations
No known affiliations.
Frequent co-authors
10from 7 papers