Cite
Notes
Only stored in your browser.
Attribution
Streaming Dense Video Captioning
CVPR 2024 1
Mixture of Nested Experts: Adaptive Processing of Visual Tokens
arXiv 2024
Simple Open-Vocabulary Object Detection with Vision Transformers
arXiv 2022
Attention Bottlenecks for Multimodal Fusion
NeurIPS 2021 12
from 4 papers
Arsha Nagrani
Cordelia Schmid
Shyamal Buch
Aditya Kusupati
Alexey Dosovitskiy
Alexey Gritsenko
Aravindh Mahendran
Aren Jansen
Austin Myers
Austin Stone