Jiadi Su
- Papers
- 7
Cite
Notes
Only stored in your browser.
7papers
Authored papers
7Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model
arXiv 2026
Thinking with Generated Images
arXiv 2025
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation
arXiv 2025
Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight
arXiv 2025
Visual Programmability: A Guide for Code-as-Thought in Chart Understanding
arXiv 2025
PC Agent: While You Sleep, AI Works -- A Cognitive Journey into Digital World
arXiv 2024
ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 7 papers