Cite
Notes
Only stored in your browser.
Attribution
InfiMM-HD: A Leap Forward in High-Resolution Multimodal Understanding
arXiv 2024
Visual Anchors Are Strong Information Aggregators For Multimodal Large Language Model
from 2 papers
Hongxia Yang
Huaibo Huang
Quanzeng You
Ran He
Xiaotian Han
Yongfei Liu
Bohan Zhai
Yiqi Wang
Yunzhe Tao