Cite
Notes
Only stored in your browser.
Attribution
MM-Instruct: Generated Visual Instructions for Large Multimodal Model Alignment
arXiv 2024
Enhancing Vision-Language Model with Unmasked Token Alignment
GeoMIM: Towards Better 3D Knowledge Transfer via Masked Image Modeling for Multi-view 3D Understanding
ICCV 2023 1
from 3 papers
Hongsheng Li
Jihao Liu
Yu Liu
Jinliang Zheng
Jia Wang
Osamu Yoshie
Qihang Zhang
Tai Wang
Xin Huang