Cite
Notes
Only stored in your browser.
Attribution
SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models
arXiv 2024
4M: Massively Multimodal Masked Modeling
4m-massively-multimodal-masked-modeling
from 2 papers
Afshin Dehghan
Amir Zamir
David Mizrahi
Haiming Gang
Hong-You Chen
Kai Kang
Mingze Xu
Oğuzhan Fatih Kar
Roman Bachmann
Teresa Yeo