Cite
Notes
Only stored in your browser.
Attribution
Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding
arXiv 2024
Right this way: Can VLMs Guide Us to See More to Answer Questions?
from 2 papers
Yi Zhang
Ching-Chen Kuo
Diji Yang
Jie Yang
Kalyana Suma Sree Tholeti
Leilani H. Gilpin
Li Liu
Shan Jiang
Sijia Zhong
Xin Eric Wang