Cite
Notes
Only stored in your browser.
Attribution
Android in the Zoo: Chain-of-Action-Thought for GUI Agents
arXiv 2024
VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models
from 2 papers
Zhongyu Wei
Duyu Tang
Jihao Wu
Minghui Liao
Minghui Qiu
Nuo Xu
Ruipu Luo
Xiao Xiao
Yihua Teng
Zejun Li