Cite
Notes
Only stored in your browser.
Attribution
HawkEye: Training Video-Text LLMs for Grounding Text in Videos
arXiv 2024
VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interaction Format
Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding
from 3 papers
Dongyan Zhao
Jianxin Liang
Yueqian Wang
Yuxuan Wang
Qun Liu
Huishuai Zhang
Jiansheng Wei