Cite
Notes
Only stored in your browser.
Attribution
InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions
arXiv 2024
Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines
from 2 papers
Xiangyu Yue
Xiaohan Ding
Yiyuan Zhang
Sanyuan Zhao
Yuhao Kang