Shihao Wang
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9Towards Multimodal Lifelong Understanding: A Dataset and Agentic Baseline
arXiv 2026
Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models
arXiv 2025
InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction
arXiv 2025
Slow-Fast Architecture for Video Multi-Modal Large Language Models
arXiv 2025
OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning
arXiv 2024
Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation
arXiv 2024
Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection
ICCV 2023 1
OmniDataComposer: A Unified Data Structure for Multimodal Data Fusion and Infinite Data Generation
arXiv 2023
Automated Model Design and Benchmarking of 3D Deep Learning Models for COVID-19 Detection with Chest CT Scans
arXiv 2021
Affiliations
Frequent co-authors
10from 9 papers