Aosong Cheng
- Papers
- 3
Cite
Notes
Only stored in your browser.
3papers
Authored papers
3MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions
arXiv 2024
DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing
arXiv 2024
Beyond Text-Visual Attention: Exploiting Visual Cues for Effective Token Pruning in VLMs
ICCV 2025
Affiliations
No known affiliations.
Frequent co-authors
10from 3 papers