Cite
Notes
Only stored in your browser.
Attribution
SpatialBot: Precise Spatial Understanding with Vision Language Models
arXiv 2024
ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
from 2 papers
Hao Dong
Xiaoqi Li
Bo Zhao
Hongsheng Li
Jianhao Yuan
Peng Gao
Siyuan Huang
Wankou Yang
Wenxiao Cai
Xiaobin Hu