Heting Gao
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion
arXiv 2026
VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model
arXiv 2025
VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction
arXiv 2025
ContentVec: An Improved Self-Supervised Speech Representation by Disentangling Speakers
arXiv 2022
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers