Chenyang Si
- Papers
- 21
Cite
Notes
Only stored in your browser.
Authored papers
21NGM: A Plug-and-Play Training-Free Memory Module for LLMs
arXiv 2026
TOBench: A Task-Oriented Omni-Modal Benchmark for Real-World Tool-Using Agents
arXiv 2026
NOVA: Sparse Control, Dense Synthesis for Pair-Free Video Editing
arXiv 2026
Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control
arXiv 2025
V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning
arXiv 2025
Dual-Expert Consistency Model for Efficient and High-Quality Video Generation
ICCV 2025
CoS: Chain-of-Shot Prompting for Long Video Understanding
arXiv 2025
PosterCopilot: Toward Layout Reasoning and Controllable Editing for Professional Graphic Design
arXiv 2025
Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers
ICCV 2025
ProGuard: Towards Proactive Multimodal Safeguard
arXiv 2025
LongVie 2: Multimodal Controllable Ultra-Long Video World Model
arXiv 2025
FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model
ICCV 2025
Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
arXiv 2025
FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality
arXiv 2024
Scaling Supervised Local Learning with Augmented Auxiliary Networks
arXiv 2024
Momentum Auxiliary Network for Supervised Local Learning
arXiv 2024
FreeU: Free Lunch in Diffusion U-Net
CVPR 2024 1
FreeInit: Bridging Initialization Gap in Video Diffusion Models
arXiv 2023
LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models
arXiv 2023
Inception Transformer
arXiv 2022
Mugs: A Multi-Granular Self-Supervised Learning Framework
arXiv 2022
Affiliations
Frequent co-authors
10from 21 papers