Chenxiao Zhao
- Papers
- 6
Cite
Notes
Only stored in your browser.
6papers
Authored papers
6Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information
arXiv 2026
REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents
arXiv 2026
VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training
arXiv 2026
Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning
arXiv 2026
DeepEyes: Incentivizing "Thinking with Images" via Reinforcement Learning
arXiv 2025
DeepEyesV2: Toward Agentic Multimodal Model
arXiv 2025
Affiliations
No known affiliations.
Frequent co-authors
10from 6 papers