Cite
Notes
Only stored in your browser.
Attribution
Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling
arXiv 2024
Lost in Space: Probing Fine-grained Spatial Understanding in Vision and Language Resamplers
from 2 papers
Alessandro Suglia
Arash Eshghi
Oliver Lemon
Malvina Nikandrou