Zuwei Long
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5Youtu-VL: Unleashing Visual Potential via Unified Vision-Language Supervision
arXiv 2026
Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion
arXiv 2026
VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model
arXiv 2025
VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction
arXiv 2025
T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers