Cite
Notes
Only stored in your browser.
Attribution
Do VLMs Need Vision Transformers? Evaluating State Space Models as Vision Encoders
arXiv 2026
Going Beyond Nouns With Vision & Language Models Using Synthetic Data
ICCV 2023 1
CODA-Prompt: COntinual Decomposed Attention-based Prompting for Rehearsal-Free Continual Learning
CVPR 2023 1
from 3 papers
Donghyun Kim
James Seale Smith
Leonid Karlinsky
Rameswar Panda
Rogerio Feris
Assaf Arbelle
Aude Oliva
Gül Varol
Khaled Shehada
Shang-Jui Ray Kuo