Shijia Yang

Cite

Notes

Only stored in your browser.

Attribution

5papers

Authored papers

CaptionQA: Is Your Caption as Useful as the Image Itself?

arXiv 2025

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

arXiv 2024

Law of Vision Representation in MLLMs

arXiv 2024

HallE-Control: Controlling Object Hallucination in Large Multimodal Models

arXiv 2023

Multitask Vision-Language Prompt Tuning

arXiv 2022

No known affiliations.

from 5 papers

Bohan Zhai

Chenfeng Xu

Kurt Keutzer

Manling Li

Sheng Shen

Benyou Wang

Bohao Li

Chunyuan Li

Emad Barsoum

Hongxia Yang