Yihe Deng
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks
arXiv 2026
DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails
arXiv 2025
Entropy-Based Adaptive Weighting for Self-Training
arXiv 2025
OpenVLThinker: An Early Exploration to Complex Vision-Language Reasoning via Iterative Self-Improvement
arXiv 2025
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models
arXiv 2024
Enhancing Large Vision Language Models with Self-Training on Image Comprehension
arXiv 2024
Mitigating Object Hallucination in Large Vision-Language Models via Classifier-Free Guidance
arXiv 2024
Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves
arXiv 2023
Towards Understanding Mixture of Experts in Deep Learning
arXiv 2022
Affiliations
Frequent co-authors
10from 9 papers