Yan Yan

Papers: 13

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

13papers

Authored papers

MVI-Bench: A Comprehensive Benchmark for Evaluating Robustness to Misleading Visual Inputs in LVLMs

arXiv 2025

2025

InfantAgent-Next: A Multimodal Generalist Agent for Automated Computer Interaction

arXiv 2025

2025

LLM Inference Unveiled: Survey and Roofline Model Insights

arXiv 2024

2024

Robin3D: Improving 3D Large Language Model via Robust Instruction Tuning

ICCV 2025

2024

QuEST: Low-bit Diffusion Model Quantization via Efficient Selective Finetuning

ICCV 2025

2024

Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention

arXiv 2024

2024

SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding

arXiv 2024

2024

Boundary Guided Learning-Free Semantic Control with Diffusion Models

boundary-guided-learning-free-semantic

2023

Towards Saner Deep Image Registration

ICCV 2023 1

2023

Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation

arXiv 2022

2022

Post-training Quantization on Diffusion Models

CVPR 2023 1

2022

Quantized GAN for Complex Music Generation from Dance Videos

arXiv 2022

2022

Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos

arXiv 2021

2021

Affiliations

No known affiliations.

Frequent co-authors

from 13 papers

Weitai Kang

Yuzhang Shang

Mubarak Shah

Ye Zhu

Yu Wu

Zhihang Yuan

Bingzhe Wu

Kyle Olszewski

Sergey Tulyakov

Ali Payani