Changyou Chen
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10GUI-AIMA: Aligning Intrinsic Multimodal Attention with a Context Anchor for GUI Grounding
arXiv 2025
MusiXQA: Advancing Visual Music Understanding in Multimodal Large Language Models
arXiv 2025
VisR-Bench: An Empirical Study on Visual Retrieval-Augmented Generation for Multilingual Long Document Understanding
arXiv 2025
TextLap: Customizing Language Models for Text-to-Layout Planning
arXiv 2024
Towards Aligned Layout Generation via Diffusion Model with Aesthetic Constraints
arXiv 2024
A Video Is Worth 4096 Tokens: Verbalize Videos To Understand Them In Zero Shot
arXiv 2023
Long-Term Ad Memorability: Understanding & Generating Memorable Ads
arXiv 2023
Shifted Diffusion for Text-to-image Generation
CVPR 2023 1
MINIMAL: Mining Models for Data Free Universal Adversarial Triggers
arXiv 2021
LAFITE: Towards Language-Free Training for Text-to-Image Generation
arXiv 2021
Affiliations
Frequent co-authors
10from 10 papers