Penghao Wu
- Papers
- 9
Cite
Notes
Only stored in your browser.
9papers
Authored papers
9SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture
arXiv 2026
Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning
arXiv 2025
Streamline Without Sacrifice -- Squeeze out Computation Redundancy in LMM
arXiv 2025
Visual Jigsaw Post-Training Improves MLLMs
arXiv 2025
GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection Behavior
arXiv 2025
GenAD: Generalized Predictive Model for Autonomous Driving
CVPR 2024 1
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs
arXiv 2024
End-to-end Autonomous Driving: Challenges and Frontiers
arXiv 2023
V*: Guided Visual Search as a Core Mechanism in Multimodal LLMs
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 9 papers