Cite
Notes
Only stored in your browser.
Attribution
Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion
CVPR 2025 1
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
CVPR 2024 1
CvT: Introducing Convolutions to Vision Transformers
ICCV 2021 10
from 3 papers
Bin Xiao
Lu Yuan
Xiyang Dai
Ce Liu
Dianqi Li
Houdong Hu
Jianfeng Gao
Jianwei Yang
Jiuhai Chen
Lei Zhang