Yueqian Wang
- Papers
- 7
Cite
Notes
Only stored in your browser.
Authored papers
7Efficient Temporal Extrapolation of Multimodal Large Language Models with Temporal Grounding Bridge
arXiv 2024
HawkEye: Training Video-Text LLMs for Grounding Text in Videos
arXiv 2024
VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interaction Format
arXiv 2024
Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding
arXiv 2024
Understanding Multimodal Hallucination with Parameter-Free Representation Alignment
arXiv 2024
STAIR: Spatial-Temporal Reasoning with Auditable Intermediate Results for Video Question Answering
arXiv 2024
VSTAR: A Video-grounded Dialogue Dataset for Situated Semantic Understanding with Scene and Topic Transitions
arXiv 2023
Affiliations
Frequent co-authors
10from 7 papers