Zhi Hou
- Papers
- 7
Cite
Notes
Only stored in your browser.
7papers
Authored papers
7Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development
arXiv 2026
Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces
arXiv 2025
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
arXiv 2025
Vlaser: Vision-Language-Action Model with Synergistic Embodied Reasoning
arXiv 2025
Dita: Scaling Diffusion Transformer for Generalist Vision-Language-Action Policy
ICCV 2025
Diffusion Transformer Policy
arXiv 2024
BatchFormer: Learning to Explore Sample Relationships for Robust Representation Learning
CVPR 2022 1
Affiliations
No known affiliations.
Frequent co-authors
10from 7 papers