0

Chenyang Si

Papers
21

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
21papers

Authored papers

21

NGM: A Plug-and-Play Training-Free Memory Module for LLMs

arXiv 2026

2026

TOBench: A Task-Oriented Omni-Modal Benchmark for Real-World Tool-Using Agents

arXiv 2026

2026

NOVA: Sparse Control, Dense Synthesis for Pair-Free Video Editing

arXiv 2026

2026

Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control

arXiv 2025

2025

V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning

arXiv 2025

2025

Dual-Expert Consistency Model for Efficient and High-Quality Video Generation

ICCV 2025

2025

CoS: Chain-of-Shot Prompting for Long Video Understanding

arXiv 2025

2025

PosterCopilot: Toward Layout Reasoning and Controllable Editing for Professional Graphic Design

arXiv 2025

2025

Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers

ICCV 2025

2025

ProGuard: Towards Proactive Multimodal Safeguard

arXiv 2025

2025

LongVie 2: Multimodal Controllable Ultra-Long Video World Model

arXiv 2025

2025

FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model

ICCV 2025

2025

Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models

arXiv 2025

2025

FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality

arXiv 2024

2024

Scaling Supervised Local Learning with Augmented Auxiliary Networks

arXiv 2024

2024

Momentum Auxiliary Network for Supervised Local Learning

arXiv 2024

2024

FreeU: Free Lunch in Diffusion U-Net

CVPR 2024 1

2023

FreeInit: Bridging Initialization Gap in Video Diffusion Models

arXiv 2023

2023

LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models

arXiv 2023

2023

Inception Transformer

arXiv 2022

2022

Mugs: A Multi-Granular Self-Supervised Learning Framework

arXiv 2022

2022

Affiliations

No known affiliations.

Frequent co-authors

10

from 21 papers