Cite
Notes
Only stored in your browser.
Attribution
GeoPixel: Pixel Grounding Large Multimodal Model in Remote Sensing
arXiv 2025
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs
Video-CoM: Interactive Video Reasoning via Chain of Manipulations
from 3 papers
Salman Khan
Fahad Shahbaz Khan
Ahmed Heakl
Akashah Shabbir
Dinura Dissanayake
Fahad S. Khan
Hanoona Rasheed
Hisham Cholakkal
Ivan Laptev
Jean Lahoud