Cite
Notes
Only stored in your browser.
Attribution
LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
arXiv 2023
Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
CVPR 2024 1
from 2 papers
Li Yuan
Bin Lin
Bin Zhu
Hongfa Wang
Jiaxi Cui
Junwu Zhang
Munan Ning
Peng Jin
Ryuichi Takanobu
Wei Liu