Han Xiao
- Papers
- 16
Cite
Notes
Only stored in your browser.
Authored papers
16MobileGym: A Verifiable and Highly Parallel Simulation Platform for Mobile GUI Agent Research
arXiv 2026
MemGUI-Bench: Benchmarking Memory of Mobile GUI Agents in Dynamic Environments
arXiv 2026
LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects
arXiv 2025
WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch
arXiv 2025
MathCoder-VL: Bridging Vision and Code for Enhanced Multimodal Mathematical Reasoning
arXiv 2025
UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents
arXiv 2025
UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning
arXiv 2025
A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
arXiv 2025
Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding
CVPR 2025 1
Lumina-Next: Making Lumina-T2X Stronger and Faster with Next-DiT
arXiv 2024
Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning
arXiv 2024
Late Chunking: Contextual Chunk Embeddings Using Long-Context Embedding Models
arXiv 2024
AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark
arXiv 2024
ImageBind-LLM: Multi-modality Instruction Tuning
arXiv 2023
Token-Label Alignment for Vision Transformers
ICCV 2023 1
Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms
arXiv 2017
Affiliations
Frequent co-authors
10from 16 papers