Cite
Notes
Only stored in your browser.
Attribution
Explore the Limits of Omni-modal Pretraining at Scale
arXiv 2024
COSA: Concatenated Sample Pretrained Vision-Language Foundation Model
arXiv 2023
from 2 papers
Jing Liu
Jiashi Feng
Sihan Chen
Xiangyu Yue
Xiaojie Jin
Xingjian He
Yiyuan Zhang