Zeliang Zhang

Papers: 9

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

9papers

Authored papers

Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models

arXiv 2025

2025

Directional Reasoning Injection for Fine-Tuning MLLMs

arXiv 2025

2025

Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting

arXiv 2025

2025

Generative AI for Cel-Animation: A Survey

arXiv 2025

2025

CalibQuant: 1-Bit KV Cache Quantization for Multimodal LLMs

arXiv 2025

2025

MMPerspective: Do MLLMs Understand Perspective? A Comprehensive Benchmark for Perspective Perception, Reasoning, and Robustness

arXiv 2025

2025

Treat Visual Tokens as Text? But Your MLLM Only Needs Fewer Efforts to See

arXiv 2024

2024

VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?

CVPR 2025 1

2024

Video Understanding with Large Language Models: A Survey

arXiv 2023

2023

Affiliations

No known affiliations.

Frequent co-authors

from 9 papers

Chenliang Xu

Chao Huang

Susan Liang

Jing Bi

Yunlong Tang

Hang Hua

Luchuan Song

Mingqian Feng

Pinxin Liu

Ali Vosoughi