Haoran Chen
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5CT-1: Vision-Language-Camera Models Transfer Spatial Reasoning Knowledge to Camera-Controllable Video Generation
arXiv 2026
Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practices
CVPR 2025 1
Recurrent Context Compression: Efficiently Expanding the Context Window of LLM
arXiv 2024
ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection
arXiv 2024
A Survey on Video Diffusion Models
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers