Guohai Xu
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents
arXiv 2026
DeepEyes: Incentivizing "Thinking with Images" via Reinforcement Learning
arXiv 2025
DeepEyesV2: Toward Agentic Multimodal Model
arXiv 2025
Cheems: A Practical Guidance for Building and Evaluating Chinese Reward Models from Scratch
arXiv 2025
AMBER: An LLM-free Multi-dimensional Benchmark for MLLMs Hallucination Evaluation
arXiv 2023
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video
arXiv 2023
CValues: Measuring the Values of Chinese Large Language Models from Safety to Responsibility
arXiv 2023
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for Pre-training and Benchmarks
arXiv 2023
Evaluation and Analysis of Hallucination in Large Vision-Language Models
arXiv 2023
UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model
arXiv 2023
mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections
arXiv 2022
Affiliations
Frequent co-authors
10from 11 papers