Cite
Notes
Only stored in your browser.
Attribution
Efficient Multimodal Large Language Models: A Survey
arXiv 2024
LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description
A Survey on Benchmarks of Multimodal Large Language Models
Generalized Category Discovery in Semantic Segmentation
arXiv 2023
from 4 papers
Chengjie Wang
Jian Li
Lizhuang Ma
Xin Tan
Yabiao Wang
Zhenye Gan
Bo Zhao
Chaoyou Fu
Ding Qi
Hao Fei