Xiyang Dai
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11Efficient Modulation for Vision Networks
arXiv 2024
LLM2CLIP: Powerful Language Model Unlocks Richer Visual Representation
arXiv 2024
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
CVPR 2024 1
GLIPv2: Unifying Localization and Vision-Language Understanding
arXiv 2022
Generalized Decoding for Pixel, Image, and Language
CVPR 2023 1
Focal Modulation Networks
arXiv 2022
Reduce Information Loss in Transformers for Pluralistic Image Inpainting
CVPR 2022 1
Dynamic Head: Unifying Object Detection Heads with Attentions
CVPR 2021 1
Florence: A New Foundation Model for Computer Vision
arXiv 2021
RegionCLIP: Region-based Language-Image Pretraining
CVPR 2022 1
CvT: Introducing Convolutions to Vision Transformers
ICCV 2021 10
Affiliations
Frequent co-authors
10from 11 papers