0

Xiaokang Yang

Papers
24

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
24papers

Authored papers

24

CodePercept: Code-Grounded Visual STEM Perception for MLLMs

arXiv 2026

2026

RoboTwin 2.0: A Scalable Data Generator and Benchmark with Strong Domain Randomization for Robust Bimanual Robotic Manipulation

arXiv 2025

2025

Low-bit Model Quantization for Deep Neural Networks: A Survey

arXiv 2025

2025

Dens3R: A Foundation Model for 3D Geometry Prediction

arXiv 2025

2025

One-Step Diffusion Model for Image Motion-Deblurring

arXiv 2025

2025

Marten: Visual Question Answering with Mask Generation for Multi-modal Document Understanding

CVPR 2025 1

2025

Degradation-Modeled Multipath Diffusion for Tunable Metalens Photography

ICCV 2025

2025

A Token-level Text Image Foundation Model for Document Understanding

ICCV 2025

2025

MM-ACT: Learn from Multimodal Parallel Generation to Act

arXiv 2025

2025

FlashEdit: Decoupling Speed, Structure, and Semantics for Precise Image Editing

arXiv 2025

2025

HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting

arXiv 2024

2024

OSDFace: One-Step Diffusion Model for Face Restoration

CVPR 2025 1

2024

Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A Survey and Benchmark

arXiv 2024

2024

Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis

arXiv 2024

2024

PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer

arXiv 2024

2024

PostEdit: Posterior Sampling for Efficient Zero-Shot Image Editing

arXiv 2024

2024

UAlign: Pushing the Limit of Template-free Retrosynthesis Prediction with Unsupervised SMILES Alignment

arXiv 2024

2024

FLoRA: Low-Rank Core Space for N-dimension

arXiv 2024

2024

MotionBank: A Large-scale Video Motion Benchmark with Disentangled Rule-based Annotations

arXiv 2024

2024

VidToMe: Video Token Merging for Zero-Shot Video Editing

CVPR 2024 1

2023

Dual Aggregation Transformer for Image Super-Resolution

ICCV 2023 1

2023

Recursive Generalization Transformer for Image Super-Resolution

arXiv 2023

2023

Image Super-Resolution with Text Prompt Diffusion

arXiv 2023

2023

Model-Based Reinforcement Learning with Multi-Task Offline Pretraining

arXiv 2023

2023

Affiliations

No known affiliations.

Frequent co-authors

10

from 24 papers