Xu Yang
- Papers
- 17
Cite
Notes
Only stored in your browser.
Authored papers
17GEditBench v2: A Human-Aligned Benchmark for General Image Editing
arXiv 2026
Covering Human Action Space for Computer Use: Data Synthesis and Benchmark
arXiv 2026
Mimic In-Context Learning for Multimodal Tasks
CVPR 2025 1
KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models
arXiv 2025
MVI-Bench: A Comprehensive Benchmark for Evaluating Robustness to Misleading Visual Inputs in LVLMs
arXiv 2025
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification
arXiv 2025
d^2Cache: Accelerating Diffusion-Based LLMs via Dual Adaptive Caching
arXiv 2025
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL
arXiv 2025
LayerCake: Token-Aware Contrastive Decoding within Large Language Model Layers
arXiv 2025
FLASH: Latent-Aware Semi-Autoregressive Speculative Decoding for Multimodal Tasks
arXiv 2025
Speculative Ensemble: Fast Large Language Model Ensemble via Speculation
arXiv 2025
Texture-Preserving Diffusion Models for High-Fidelity Virtual Try-On
CVPR 2024 1
Number it: Temporal Grounding Videos like Flipping Manga
CVPR 2025 1
Unlearning Concepts in Diffusion Model via Concept Domain Correction and Concept Preserving Gradient
arXiv 2024
Transformer as Linear Expansion of Learngene
arXiv 2023
Exploring Diverse In-Context Configurations for Image Captioning
NeurIPS 2023 11
Learning to Collocate Visual-Linguistic Neural Modules for Image Captioning
arXiv 2022
Affiliations
Frequent co-authors
10from 17 papers