Ali Vosoughi
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models
arXiv 2025
MMPerspective: Do MLLMs Understand Perspective? A Comprehensive Benchmark for Perspective Perception, Reasoning, and Robustness
arXiv 2025
Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting
arXiv 2025
Video Understanding with Large Language Models: A Survey
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers