Yushi Hu
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10PerceptionComp: A Video Benchmark for Complex Perception-Centric Reasoning
arXiv 2026
BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset
arXiv 2025
Multimodal RewardBench 2: Evaluating Omni Reward Models for Interleaved Text and Image
arXiv 2025
MMMG: a Comprehensive and Reliable Evaluation Suite for Multitask Multimodal Generation
arXiv 2025
Training Language Models to Generate Text with Citations via Fine-grained Rewards
arXiv 2024
TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering
ICCV 2023 1
One Embedder, Any Task: Instruction-Finetuned Text Embeddings
arXiv 2022
PromptCap: Prompt-Guided Task-Aware Image Captioning
arXiv 2022
In-Context Learning for Few-Shot Dialogue State Tracking
arXiv 2022
Binding Language Models in Symbolic Languages
arXiv 2022
Affiliations
Frequent co-authors
10from 10 papers
Noah A. Smith
Mari Ostendorf
Luke Zettlemoyer
professor
Ranjay Krishna
Tao Yu
professor
Benlin Liu
Caiming Xiong
researcher
Jungo Kasai
Tianbao Xie
grad-student
Weijia Shi
researcher