Cite
Notes
Only stored in your browser.
Attribution
Think-Then-Generate: Reasoning-Aware Text-to-Image Diffusion with LLM Encoders
arXiv 2026
Orthus: Autoregressive Interleaved Image-Text Generation with Modality-Specific Heads
arXiv 2024
Music Grounding by Short Video
from 3 papers
Peng Jiang
Quan Chen
Jiachun Jin
Siqi Kou
Zhijie Deng
Chang Liu
Jian Jia
Jingyu Liu
Jun Zhu
Kai Yu