Cite
Notes
Only stored in your browser.
Attribution
MM-Spatial: Exploring 3D Spatial Understanding in Multimodal LLMs
arXiv 2025
SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models
arXiv 2024
from 2 papers
Afshin Dehghan
Kai Kang
David Griffiths
Erik Daxberger
Gefen Kohavi
Hong-You Chen
Justin Lazarow
Marcin Eichner
Mingfei Gao
Mingze Xu