Cite
Notes
Only stored in your browser.
Attribution
Training Vision-Language Process Reward Models for Test-Time Scaling in Multimodal Reasoning: Key Insights and Lessons Learned
arXiv 2025
Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning
arXiv 2024
from 2 papers
Soujanya Poria
Tej Deep Pala
Brandon Ong
Deepanway Ghosal
Pengfei Hong
Qi Sun
U-Xuan Tan
William Chandra Tjhi