Cite
Notes
Only stored in your browser.
Attribution
ViPRA: Video Prediction for Robot Actions
arXiv 2025
Pretrained Language Models as Visual Planners for Human Assistance
ICCV 2023 1
Learning State-Aware Visual Representations from Audible Interactions
arXiv 2022
from 3 papers
Abhinav Gupta
Deepak Pathak
Dhruvesh Patel
Hamid Eghbalzadeh
Hengkai Pan
Himangi Mittal
Michael Louis Iuzzolino
Nitin Kamra
Pedro Morgado
Ruta Desai