Hao Shao
- Papers
- 8
Cite
Notes
Only stored in your browser.
Authored papers
8DrivingGen: A Comprehensive Benchmark for Generative Video World Models in Autonomous Driving
arXiv 2026
DriveDreamer-Policy: A Geometry-Grounded World-Action Model for Unified Generation and Planning
arXiv 2026
Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding
CVPR 2025 1
Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning
arXiv 2024
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models
arXiv 2024
MoVA: Adapting Mixture of Vision Experts to Multimodal Context
arXiv 2024
LMDrive: Closed-Loop End-to-End Driving with Large Language Models
CVPR 2024 1
MCANet: Medical Image Segmentation with Multi-Scale Cross-Axis Attention
arXiv 2023
Affiliations
Frequent co-authors
10from 8 papers