Chengjie Wang
- Papers
- 31
Cite
Notes
Only stored in your browser.
Authored papers
31L2P: Unlocking Latent Potential for Pixel Generation
arXiv 2026
AdaptCLIP: Adapting CLIP for Universal Visual Anomaly Detection
arXiv 2025
Disentangle Identity, Cooperate Emotion: Correlation-Aware Emotional Talking Portrait Generation
arXiv 2025
Few-Shot Anomaly-Driven Generation for Anomaly Classification and Segmentation
arXiv 2025
Soul: Breathe Life into Digital Human for High-fidelity Long-term Multimodal Animation
arXiv 2025
Transform Trained Transformer: Accelerating Naive 4K Video Generation Over 10times
arXiv 2025
StrandDesigner: Towards Practical Strand Generation with Sketch Guidance
arXiv 2025
SVFR: A Unified Framework for Generalized Video Face Restoration
CVPR 2025 1
DiP: Taming Diffusion Models in Pixel Space
arXiv 2025
MobileMamba: Lightweight Multi-Receptive Visual Mamba Network
CVPR 2025 1
Efficient Multimodal Large Language Models: A Survey
arXiv 2024
MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection
arXiv 2024
MMAD: The First-Ever Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection
arXiv 2024
LLaVA-KD: A Framework of Distilling Multimodal Large Language Models
arXiv 2024
DF40: Toward Next-Generation Deepfake Detection
arXiv 2024
VTON-HandFit: Virtual Try-on for Arbitrary Hand Pose Guided by Hand Priors Embedding
CVPR 2025 1
Learning Multi-view Anomaly Detection
arXiv 2024
A Comprehensive Library for Benchmarking Multi-class Visual Anomaly Detection
arXiv 2024
A Survey on Benchmarks of Multimodal Large Language Models
arXiv 2024
Tuning-Free Image Customization with Image and Text Guidance
arXiv 2024
EMOv2: Pushing 5M Vision Model Frontier
arXiv 2024
CustAny: Customizing Anything from A Single Example
CVPR 2025 1
LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description
arXiv 2024
FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on
arXiv 2024
Deep Industrial Image Anomaly Detection: A Survey
arXiv 2023
Transferable Decoding with Visual Entities for Zero-Shot Image Captioning
ICCV 2023 1
Stroke-based Neural Painting and Stylization with Dynamically Predicted Painting Region
arXiv 2023
Rethinking Mobile Block for Efficient Attention-based Models
ICCV 2023 1
Phasic Content Fusing Diffusion Model with Directional Distribution Consistency for Few-Shot Model Adaption
ICCV 2023 1
Surface Representation for Point Clouds
arXiv 2022
How to Reduce Change Detection to Semantic Segmentation
arXiv 2022
Affiliations
Frequent co-authors
10from 31 papers