Yanhong Zeng
- Papers
- 18
Cite
Notes
Only stored in your browser.
Authored papers
18Advancing Open-source World Models
arXiv 2026
WORLDMEM: Long-term Consistent World Simulation with Memory
arXiv 2025
AniCrafter: Customizing Realistic Human-Centric Animation via Avatar-Background Conditioning in Video Diffusion Models
arXiv 2025
Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation
arXiv 2025
The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text
arXiv 2025
HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives
arXiv 2025
CharacterShot: Controllable and Consistent 4D Character Animation
arXiv 2025
LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?
arXiv 2025
MagicQuillV2: Precise and Interactive Image Editing with Layered Visual Cues
arXiv 2025
Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset
arXiv 2025
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds
arXiv 2024
DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation
CVPR 2025 1
StyleShot: A Snapshot on Any Style
arXiv 2024
HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation
arXiv 2024
PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models
CVPR 2024 1
A Task is Worth One Word: Learning with Task Prompts for High-Quality Versatile Image Inpainting
arXiv 2023
Aggregated Contextual Transformations for High-Resolution Image Inpainting
arXiv 2021
Learning Joint Spatial-Temporal Transformations for Video Inpainting
ECCV 2020 8
Affiliations
Frequent co-authors
10from 18 papers