Hang Song
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5Vision-Language-Action Models for Autonomous Driving: Past, Present, and Future
arXiv 2025
Mixed-R1: Unified Reward Perspective For Reasoning Capability in Multimodal Large Language Models
arXiv 2025
UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model
arXiv 2024
GroundingGPT:Language Enhanced Multi-modal Grounding Model
arXiv 2024
Advancing Fine-Grained Visual Understanding with Multi-Scale Alignment in Multi-Modal Models
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers