Xiyang Dai

Papers: 11

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

11papers

Authored papers

Efficient Modulation for Vision Networks

arXiv 2024

2024

LLM2CLIP: Powerful Language Model Unlocks Richer Visual Representation

arXiv 2024

2024

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

CVPR 2024 1

2023

GLIPv2: Unifying Localization and Vision-Language Understanding

arXiv 2022

2022

Focal Modulation Networks

arXiv 2022

2022

Reduce Information Loss in Transformers for Pluralistic Image Inpainting

CVPR 2022 1

2022

Generalized Decoding for Pixel, Image, and Language

CVPR 2023 1

2022

RegionCLIP: Region-based Language-Image Pretraining

CVPR 2022 1

2021

CvT: Introducing Convolutions to Vision Transformers

ICCV 2021 10

2021

Dynamic Head: Unifying Object Detection Heads with Attentions

CVPR 2021 1

2021

Florence: A New Foundation Model for Computer Vision

arXiv 2021

2021

Affiliations

No known affiliations.

Frequent co-authors

from 11 papers

Lu Yuan

Bin Xiao

Jianfeng Gao

Jianwei Yang

Chunyuan Li

Dongdong Chen

Mengchen Liu

Lijuan Wang

Noel Codella

Pengchuan Zhang