Yinfei Yang
- Papers
- 17
Cite
Notes
Only stored in your browser.
Authored papers
17Proactive Agent Research Environment: Simulating Active Users to Evaluate Proactive Assistants
arXiv 2026
MM-Spatial: Exploring 3D Spatial Understanding in Multimodal LLMs
arXiv 2025
GIE-Bench: Towards Grounded Evaluation for Text-Guided Image Editing
arXiv 2025
Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing
arXiv 2025
PRISM-Bench: A Benchmark of Puzzle-Based Visual Tasks with CoT Error Detection
arXiv 2025
Multimodal Autoregressive Pre-training of Large Vision Encoders
CVPR 2025 1
Improve Vision Language Model Chain-of-thought Reasoning
arXiv 2024
MIA-Bench: Towards Better Instruction Following Evaluation of Multimodal LLMs
arXiv 2024
How Easy is It to Fool Your Multimodal LLMs? An Empirical Analysis on Deceptive Prompts
arXiv 2024
Ferret: Refer and Ground Anything Anywhere at Any Granularity
arXiv 2023
VeCLIP: Improving CLIP Training via Visual-enriched Captions
arXiv 2023
MOFI: Learning Image Representations from Noisy Entity Annotated Images
arXiv 2023
Guiding Instruction-based Image Editing via Multimodal Large Language Models
arXiv 2023
Compressing LLMs: The Truth is Rarely Pure and Never Simple
arXiv 2023
DocAsRef: An Empirical Study on Repurposing Reference-Based Summary Quality Metrics Reference-Freely
arXiv 2022
MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models
arXiv 2020
PAWS-X: A Cross-lingual Adversarial Dataset for Paraphrase Identification
paws-x-a-cross-lingual-adversarial-dataset-1
Affiliations
Frequent co-authors
10from 17 papers