Ziqi Huang
- Papers
- 20
Cite
Notes
Only stored in your browser.
Authored papers
20Prompt Relay: Inference-Time Temporal Control for Multi-Event Video Generation
arXiv 2026
Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond
arXiv 2026
BabyVision: Visual Reasoning Beyond Language
arXiv 2026
Demystifying Video Reasoning
arXiv 2026
VChain: Chain-of-Visual-Thought for Reasoning in Video Generation
arXiv 2025
CineScale: Free Lunch in High-Resolution Cinematic Visual Generation
arXiv 2025
Cut2Next: Generating Next Shot via In-Context Tuning
arXiv 2025
The Quest for Generalizable Motion Generation: Data, Model, and Evaluation
arXiv 2025
ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models
arXiv 2025
Uni-MMMU: A Massive Multi-discipline Multimodal Unified Benchmark
arXiv 2025
VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness
arXiv 2025
Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
arXiv 2025
Simulating the Visual World with Artificial Intelligence: A Roadmap
arXiv 2025
Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models
arXiv 2024
FreeU: Free Lunch in Diffusion U-Net
CVPR 2024 1
FreeInit: Bridging Initialization Gap in Video Diffusion Models
arXiv 2023
ReVersion: Diffusion-Based Relation Inversion from Images
arXiv 2023
LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models
arXiv 2023
Collaborative Diffusion for Multi-Modal Face Generation and Editing
CVPR 2023 1
Talk-to-Edit: Fine-Grained Facial Editing via Dialog
ICCV 2021 10
Affiliations
Frequent co-authors
10from 20 papers