Cite
Notes
Only stored in your browser.
Attribution
Benchmarking and Improving Detail Image Caption
arXiv 2024
Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment
World to Code: Multi-modal Data Generation via Self-Instructed Compositional Captioning and Filtering
from 3 papers
Haoyuan Guo
Jiacong Wang
Xin Xiao
Xun Zhou
Chunyuan Li
Haiyong Jiang
Hongyuan Dong
Jiawen Li
Jun Xiao
Yuan Zhang