Cite
Notes
Only stored in your browser.
Attribution
LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation?
arXiv 2024
VITATECS: A Diagnostic Dataset for Temporal Concept Understanding of Video-Language Models
arXiv 2023
from 2 papers
Shuhuai Ren
Xu sun
Jianhong Bai
Kaikai An
Lei LI
Linli Yao
Lu Hou
Qingyan Guo
Shicheng Li
Yi Liu