Xiaogang Wang
- Papers
- 12
Cite
Notes
Only stored in your browser.
Authored papers
12Language-based Trial and Error Falls Behind in the Era of Experience
arXiv 2026
Phased Consistency Models
arXiv 2024
Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory
arXiv 2023
ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process
arXiv 2023
DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving
arXiv 2023
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
CVPR 2023 1
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks
CVPR 2023 1
Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information
CVPR 2023 1
Demystify Transformers & Convolutions in Modern Image Deep Networks
arXiv 2022
Deformable DETR: Deformable Transformers for End-to-End Object Detection
deformable-detr-deformable-transformers-for
StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks
arXiv 2017
Spatial As Deep: Spatial CNN for Traffic Scene Understanding
arXiv 2017
Affiliations
Frequent co-authors
10from 12 papers