Shijia Yang
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5CaptionQA: Is Your Caption as Useful as the Image Itself?
arXiv 2025
AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?
arXiv 2024
Law of Vision Representation in MLLMs
arXiv 2024
HallE-Control: Controlling Object Hallucination in Large Multimodal Models
arXiv 2023
Multitask Vision-Language Prompt Tuning
arXiv 2022
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers