Cite
Notes
Only stored in your browser.
Attribution
GroundingGPT:Language Enhanced Multi-modal Grounding Model
arXiv 2024
UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model
Advancing Fine-Grained Visual Understanding with Multi-Scale Alignment in Multi-Modal Models
from 3 papers
Hang Song
Zhaowei Li
Botian Jiang
Dong Zhang
Pengyu Wang
Qi Xu
Tao Wang
Wei Wang
Zhida Huang
Junting Pan