Xu Zhang
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13ResTok: Learning Hierarchical Residuals in 1D Visual Tokenizers for Autoregressive Image Generation
arXiv 2026
MobileWorld: Benchmarking Autonomous Mobile Agents in Agent-User Interactive, and MCP-Augmented Environments
arXiv 2025
MAI-UI Technical Report: Real-World Centric Foundation GUI Agents
arXiv 2025
Perception-Oriented Latent Coding for High-Performance Compressed Domain Semantic Inference
arXiv 2025
Enabling Versatile Controls for Video Diffusion Models
arXiv 2025
A General Knowledge Injection Framework for ICD Coding
a-general-knowledge-injection-framework-for
UFO: A UI-Focused Agent for Windows OS Interaction
arXiv 2024
Content-Based Collaborative Generation for Recommender Systems
arXiv 2024
RSTeller: Scaling Up Visual Language Modeling in Remote Sensing with Rich Linguistic Semantics from Openly Available Data and Large Language Models
arXiv 2024
Benchmarking Knowledge Boundary for Large Language Models: A Different Perspective on Model Evaluation
arXiv 2024
TaskWeaver: A Code-First Agent Framework
arXiv 2023
CLAMP: Prompt-based Contrastive Learning for Connecting Language and Animal Pose
CVPR 2023 1
Selective Fairness in Recommendation via Prompts
arXiv 2022
Affiliations
Frequent co-authors
10from 13 papers