Dian Zheng
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13Gen-Searcher: Reinforcing Agentic Search for Image Generation
arXiv 2026
LongCat-Next: Lexicalizing Modalities as Discrete Tokens
arXiv 2026
HiAR: Efficient Autoregressive Long Video Generation via Hierarchical Denoising
arXiv 2026
Uni-Edit: Intelligent Editing Is A General Task For Unified Model Tuning
arXiv 2026
OneThinker: All-in-one Reasoning Model for Image and Video
arXiv 2025
EditThinker: Unlocking Iterative Reasoning for Any Image Editor
arXiv 2025
OpenSubject: Leveraging Video-Derived Identity and Diversity Priors for Subject-driven Image Generation and Manipulation
arXiv 2025
Architecture Decoupling Is Not All You Need For Unified Multimodal Model
arXiv 2025
ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models
arXiv 2025
Uni-MMMU: A Massive Multi-discipline Multimodal Unified Benchmark
arXiv 2025
VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness
arXiv 2025
ProEdit: Inversion-based Editing From Prompts Done Right
arXiv 2025
Estimator Meets Equilibrium Perspective: A Rectified Straight Through Estimator for Binary Neural Networks Training
ICCV 2023 1
Affiliations
Frequent co-authors
10from 13 papers