Cite
Notes
Only stored in your browser.
Attribution
BOLT: Boost Large Vision-Language Model Without Training for Long-form Video Understanding
CVPR 2025 1
CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents
arXiv 2024
from 2 papers
Bernard Ghanem
Anjie Yang
Bochen Qian
Chen Zhao
Dai-Jie Wu
Guohao Li
Jianbo Deng
Linyao Chen
Philip Torr
Shilong Liu