Daiki Shimada
- Papers
- 3
Cite
Notes
Only stored in your browser.
3papers
Authored papers
3Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models
arXiv 2025
Video-R4: Reinforcing Text-Rich Video Reasoning with Visual Rumination
arXiv 2025
Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting
arXiv 2025
Affiliations
No known affiliations.
Frequent co-authors
10from 3 papers