Yan Yan
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13InfantAgent-Next: A Multimodal Generalist Agent for Automated Computer Interaction
arXiv 2025
MVI-Bench: A Comprehensive Benchmark for Evaluating Robustness to Misleading Visual Inputs in LVLMs
arXiv 2025
LLM Inference Unveiled: Survey and Roofline Model Insights
arXiv 2024
SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding
arXiv 2024
Robin3D: Improving 3D Large Language Model via Robust Instruction Tuning
ICCV 2025
QuEST: Low-bit Diffusion Model Quantization via Efficient Selective Finetuning
ICCV 2025
Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention
arXiv 2024
Boundary Guided Learning-Free Semantic Control with Diffusion Models
boundary-guided-learning-free-semantic
Towards Saner Deep Image Registration
ICCV 2023 1
Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation
arXiv 2022
Post-training Quantization on Diffusion Models
CVPR 2023 1
Quantized GAN for Complex Music Generation from Dance Videos
arXiv 2022
Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos
arXiv 2021
Affiliations
Frequent co-authors
10from 13 papers