Cite
Notes
Only stored in your browser.
Attribution
Mono-InternVL-1.5: Towards Cheaper and Faster Monolithic Multimodal Large Language Models
arXiv 2025
Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding
from 2 papers
Changyao Tian
Gen Luo
Hao Li
Jifeng Dai
Xizhou Zhu
Xue Yang
Yu Qiao
Zhaokai Wang
Junqi Ge
Lewei Lu