Cite
Notes
Only stored in your browser.
Attribution
MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action
arXiv 2023
SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning
CVPR 2022 1
UNITER: UNiversal Image-TExt Representation Learning
ECCV 2020 8
from 3 papers
Linjie Li
Kevin Lin
Lijuan Wang
Zhe Gan
Zicheng Liu
Ahmed El Kholy
Ce Liu
Chung-Ching Lin
Ehsan Azarnasab
JianFeng Wang