Cite
Notes
Only stored in your browser.
Attribution
MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs
arXiv 2025
Towards Perceiving Small Visual Details in Zero-shot Visual Question Answering with Multimodal LLMs
arXiv 2023
from 2 papers
Filip Ilievski
Jiarui Zhang
Mahyar Khayatkhoei