Cite
Notes
Only stored in your browser.
Attribution
DocSLM: A Small Vision-Language Model for Long Multimodal Document Understanding
arXiv 2025
ReVisionLLM: Recursive Vision-Language Model for Temporal Grounding in Hour-Long Videos
CVPR 2025 1
from 2 papers
Gedas Bertasius
Thomas Seidl
Dimitrios Mallios
Faegheh Sardari
Jindong Gu
Md Mohaiminul Islam
Mohsen Fayyaz
Parth Pathak
Sunando Sengupta