Zixin Zhang
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters
arXiv 2026
DVD: Deterministic Video Depth Estimation with Generative Priors
arXiv 2026
Panoramic Affordance Prediction
arXiv 2026
Show, Don't Tell: Morphing Latent Reasoning into Image Generation
arXiv 2026
ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback
arXiv 2025
Step-DeepResearch Technical Report
arXiv 2025
A4-Agent: An Agentic Framework for Zero-Shot Affordance Reasoning
arXiv 2025
DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation
arXiv 2025
TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models
arXiv 2025
Step-Audio 2 Technical Report
arXiv 2025
PhysToolBench: Benchmarking Physical Tool Understanding for MLLMs
arXiv 2025
Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction
arXiv 2025
PlankAssembly: Robust 3D Reconstruction from Three Orthographic Views with Learnt Shape Programs
ICCV 2023 1
Affiliations
Frequent co-authors
10from 13 papers