Licheng Yu
- Papers
- 14
Cite
Notes
Only stored in your browser.
Authored papers
14ROICtrl: Boosting Instance Control for Visual Generation
CVPR 2025 1
Movie Gen: A Cast of Media Foundation Models
arXiv 2024
AVID: Any-Length Video Inpainting with Diffusion Model
CVPR 2024 1
CiT: Curation in Training for Effective Vision-Language Data
ICCV 2023 1
Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations
CVPR 2023 1
FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks
CVPR 2023 1
AMELI: Enhancing Multimodal Entity Linking with Fine-Grained Attributes
arXiv 2023
VIOLIN: A Large-Scale Dataset for Video-and-Language Inference
violin-a-large-scale-dataset-for-video-and-1
What is More Likely to Happen Next? Video-and-Language Future Event Prediction
EMNLP 2020 11
HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training
EMNLP 2020 11
TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
ECCV 2020 8
TVQA+: Spatio-Temporal Grounding for Video Question Answering
tvqa-spatio-temporal-grounding-for-video-1
UNITER: UNiversal Image-TExt Representation Learning
ECCV 2020 8
Modeling Context in Referring Expressions
arXiv 2016
Affiliations
Frequent co-authors
10from 14 papers