Cite
Notes
Only stored in your browser.
Attribution
UniDoc-RL: Coarse-to-Fine Visual RAG with Hierarchical Actions and Dense Rewards
arXiv 2026
Region-based Cluster Discrimination for Visual Representation Learning
ICCV 2025
Croc: Pretraining Large Multimodal Models with Cross-Modal Comprehension
arXiv 2024
from 3 papers
Kaicheng Yang
Ziyong Feng
Ismail Elezi
Jiankang Deng
Roy Miles
Tiancheng Gu
Weimo Deng
Xiang An
Yin Xie
Yumeng Wang