Han Xiao

Papers: 16

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

16papers

Authored papers

MobileGym: A Verifiable and Highly Parallel Simulation Platform for Mobile GUI Agent Research

arXiv 2026

2026

MemGUI-Bench: Benchmarking Memory of Mobile GUI Agents in Dynamic Environments

arXiv 2026

2026

LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects

arXiv 2025

2025

WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch

arXiv 2025

2025

MathCoder-VL: Bridging Vision and Code for Enhanced Multimodal Mathematical Reasoning

arXiv 2025

2025

UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents

arXiv 2025

2025

Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding

CVPR 2025 1

2025

UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning

arXiv 2025

2025

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

arXiv 2025

2025

Lumina-Next: Making Lumina-T2X Stronger and Faster with Next-DiT

arXiv 2024

2024

Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning

arXiv 2024

2024

Late Chunking: Contextual Chunk Embeddings Using Long-Context Embedding Models

arXiv 2024

2024

AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark

arXiv 2024

2024

ImageBind-LLM: Multi-modality Instruction Tuning

arXiv 2023

2023

Token-Label Alignment for Vision Transformers

ICCV 2023 1

2022

Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms

arXiv 2017

2017

Affiliations

No known affiliations.

Frequent co-authors

from 16 papers

Hongsheng Li

Shuai Ren

Aojun Zhou

Liang Liu

Yuxiang Chai

Hao Wang

Ke Wang

Peng Gao

Weifeng Lin

Xiaoxin Chen