Yuexian Zou
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13DermoGPT: Open Weights and Open Data for Morphology-Grounded Dermatological Reasoning MLLMs
arXiv 2026
HeartMuLa: A Family of Open Sourced Music Foundation Models
arXiv 2026
VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model
arXiv 2025
VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning
arXiv 2025
VASparse: Towards Efficient Visual Hallucination Mitigation for Large Vision-Language Model via Visual-Aware Sparsification
arXiv 2025
BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing
arXiv 2025
Advancing General Multimodal Capability of Vision-language Models with Pyramid-descent Visual Position Encoding
arXiv 2025
Retrieval is Accurate Generation
arXiv 2024
BrushEdit: All-In-One Image Inpainting and Editing
arXiv 2024
WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research
arXiv 2023
MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning
arXiv 2023
UnifiedVisionGPT: Streamlining Vision-Oriented AI through Generalized Multimodal Framework
arXiv 2023
ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation
arXiv 2023
Affiliations
Frequent co-authors
10from 13 papers