Xin Han
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5Awaking Spatial Intelligence in Unified Multimodal Understanding and Generation
arXiv 2026
Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction
arXiv 2025
Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model
arXiv 2025
Step-Video-TI2V Technical Report: A State-of-the-Art Text-Driven Image-to-Video Generation Model
arXiv 2025
NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale
arXiv 2025
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers