Cite
Notes
Only stored in your browser.
Attribution
HaloQuest: A Visual Hallucination Dataset for Advancing Multimodal Reasoning
arXiv 2024
IdealGPT: Iteratively Decomposing Vision and Language Reasoning via Large Language Models
arXiv 2023
UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding
from 3 papers
Haoxuan You
Kai-Wei Chang
Rui Sun
Shih-Fu Chang
Adams Yu
Garrett Bingham
Gengyu Wang
Golnaz Ghiasi
Hammad A. Ayyubi
Long Chen