Hao Shao

Papers: 8

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

8papers

Authored papers

DrivingGen: A Comprehensive Benchmark for Generative Video World Models in Autonomous Driving

arXiv 2026

2026

DriveDreamer-Policy: A Geometry-Grounded World-Action Model for Unified Generation and Planning

arXiv 2026

2026

Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding

CVPR 2025 1

2025

Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning

arXiv 2024

2024

SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models

arXiv 2024

2024

MoVA: Adapting Mixture of Vision Experts to Multimodal Context

arXiv 2024

2024

LMDrive: Closed-Loop End-to-End Driving with Large Language Models

CVPR 2024 1

2023

MCANet: Medical Image Segmentation with Multi-Scale Cross-Axis Attention

arXiv 2023

2023

Affiliations

No known affiliations.

Frequent co-authors

from 8 papers

Hongsheng Li

Letian Wang

Steven L. Waslander

Yu Liu

Zhuofan Zong

Guanglu Song

Han Xiao

Peng Gao

Yang Zhou

Aojun Zhou