Chia-Wen Kuo

Cite

Notes

Only stored in your browser.

Attribution

3papers

Authored papers

Vidi: Large Multimodal Models for Video Understanding and Editing

arXiv 2025

Where do Large Vision-Language Models Look at when Answering Questions?

arXiv 2025

CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts

arXiv 2024

No known affiliations.

from 3 papers

Fan Chen

Longyin Wen

Sijie Zhu

Celong Liu

Dawei Du

Guang Chen

Humphrey Shi

Jiachen Li

Jiamin Yuan

Jitesh Jain