Mengchen Zhang
- Papers
- 7
Cite
Notes
Only stored in your browser.
7papers
Authored papers
7ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation
arXiv 2025
SS4D: Native 4D Generative Model via Structured Spacetime Latents
arXiv 2025
GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography
ICCV 2025
IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations
arXiv 2024
FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models
arXiv 2024
Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation
arXiv 2024
Gemini vs GPT-4V: A Preliminary Comparison and Combination of Vision-Language Models Through Qualitative Cases
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 7 papers