Haoran Chen

Cite

Notes

Only stored in your browser.

Attribution

5papers

Authored papers

CT-1: Vision-Language-Camera Models Transfer Spatial Reasoning Knowledge to Camera-Controllable Video Generation

arXiv 2026

Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practices

CVPR 2025 1

Recurrent Context Compression: Efficiently Expanding the Context Window of LLM

arXiv 2024

ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection

arXiv 2024

A Survey on Video Diffusion Models

arXiv 2023

No known affiliations.

from 5 papers

Yu-Gang Jiang

Zuxuan Wu

Chensen Huang

Dong Yi

Guibo Zhu

Guojing Ge

Han Hu

Hang Xu

Haoran Jiang

Haoyu Zhao