Jin Li
- Papers
- 3
Cite
Notes
Only stored in your browser.
3papers
Authored papers
3Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model
arXiv 2026
UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding
arXiv 2024
From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 3 papers