0

Yinfei Yang

Papers
17

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
17papers

Authored papers

17

Proactive Agent Research Environment: Simulating Active Users to Evaluate Proactive Assistants

arXiv 2026

2026

MM-Spatial: Exploring 3D Spatial Understanding in Multimodal LLMs

arXiv 2025

2025

GIE-Bench: Towards Grounded Evaluation for Text-Guided Image Editing

arXiv 2025

2025

Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing

arXiv 2025

2025

PRISM-Bench: A Benchmark of Puzzle-Based Visual Tasks with CoT Error Detection

arXiv 2025

2025

Multimodal Autoregressive Pre-training of Large Vision Encoders

CVPR 2025 1

2024

Improve Vision Language Model Chain-of-thought Reasoning

arXiv 2024

2024

MIA-Bench: Towards Better Instruction Following Evaluation of Multimodal LLMs

arXiv 2024

2024

How Easy is It to Fool Your Multimodal LLMs? An Empirical Analysis on Deceptive Prompts

arXiv 2024

2024

Ferret: Refer and Ground Anything Anywhere at Any Granularity

arXiv 2023

2023

VeCLIP: Improving CLIP Training via Visual-enriched Captions

arXiv 2023

2023

MOFI: Learning Image Representations from Noisy Entity Annotated Images

arXiv 2023

2023

Guiding Instruction-based Image Editing via Multimodal Large Language Models

arXiv 2023

2023

Compressing LLMs: The Truth is Rarely Pure and Never Simple

arXiv 2023

2023

DocAsRef: An Empirical Study on Repurposing Reference-Based Summary Quality Metrics Reference-Freely

arXiv 2022

2022

MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models

arXiv 2020

2020

PAWS-X: A Cross-lingual Adversarial Dataset for Paraphrase Identification

paws-x-a-cross-lingual-adversarial-dataset-1

2019

Affiliations

No known affiliations.

Frequent co-authors

10

from 17 papers