Jiabo Ye

Papers: 12

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

12papers

Authored papers

Mobile-Agent-v3: Fundamental Agents for GUI Automation

arXiv 2025

2025

UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning

arXiv 2025

2025

AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization

CVPR 2025 1

2025

Qwen2.5-VL Technical Report

arXiv 2025

2025

Look Before You Leap: A GUI-Critic-R1 Model for Pre-Operative Error Diagnosis in GUI Automation

arXiv 2025

2025

mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models

arXiv 2024

2024

mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page Document Understanding

arXiv 2024

2024

mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration

CVPR 2024 1

2023

Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for Pre-training and Benchmarks

arXiv 2023

2023

mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video

arXiv 2023

2023

UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model

arXiv 2023

2023

mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections

arXiv 2022

2022

Affiliations

No known affiliations.

Frequent co-authors

from 12 papers

Fei Huang

Haiyang Xu

Ming Yan

Ji Zhang

Jingren Zhou

Anwen Hu

Qi Qian

Chenliang Li

Guohai Xu

Haowei Liu