Weifeng Lin
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11MemGUI-Bench: Benchmarking Memory of Mobile GUI Agents in Dynamic Environments
arXiv 2026
Rethinking VLM Representation for VLA Initialization
arXiv 2026
UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents
arXiv 2025
LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects
arXiv 2025
MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning
arXiv 2025
IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models
arXiv 2025
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining
arXiv 2024
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
arXiv 2024
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions
arXiv 2024
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models
arXiv 2024
Scale-Aware Modulation Meet Transformer
ICCV 2023 1
Affiliations
Frequent co-authors
10from 11 papers