Cite
Notes
Only stored in your browser.
Attribution
SketchVLM: Vision language models can annotate images to explain thoughts and guide users
arXiv 2026
Vision language models are blind: Failing to translate detailed visual features into words
arXiv 2024
from 2 papers
Anh Totti Nguyen
Mohammad Reza Taesiri
Brandon Collins
Hung Huy Nguyen
Pooyan Rahmanzadehgervi
Trung Bui