Weixian Lei
- Papers
- 7
Cite
Notes
Only stored in your browser.
7papers
Authored papers
7FlowInOne:Unifying Multimodal Generation as Image-in, Image-out Flow Matching
arXiv 2026
Let ViT Speak: Generative Language-Image Pre-training
arXiv 2026
Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding
arXiv 2025
The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer
ICCV 2025
ShowUI: One Vision-Language-Action Model for GUI Visual Agent
CVPR 2025 1
ViT-Lens: Initiating Omni-Modal Exploration through 3D Insights
arXiv 2023
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
ICCV 2023 1
Affiliations
No known affiliations.
Frequent co-authors
10from 7 papers