Cite
Notes
Only stored in your browser.
Attribution
Croc: Pretraining Large Multimodal Models with Cross-Modal Comprehension
arXiv 2024
Multi-label Cluster Discrimination for Visual Representation Learning
VAR-CLIP: Text-to-Image Generator with Visual Auto-Regressive Modeling
from 3 papers
Xiang An
Ziyong Feng
Jiankang Deng
Kaicheng Yang
Ninghua Yang
Ismail Elezi
Qian Zhang
Roy Miles
Tiancheng Gu
Weimo Deng