0

Chengjie Wang

Papers
31

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
31papers

Authored papers

31

L2P: Unlocking Latent Potential for Pixel Generation

arXiv 2026

2026

AdaptCLIP: Adapting CLIP for Universal Visual Anomaly Detection

arXiv 2025

2025

Disentangle Identity, Cooperate Emotion: Correlation-Aware Emotional Talking Portrait Generation

arXiv 2025

2025

Few-Shot Anomaly-Driven Generation for Anomaly Classification and Segmentation

arXiv 2025

2025

Soul: Breathe Life into Digital Human for High-fidelity Long-term Multimodal Animation

arXiv 2025

2025

Transform Trained Transformer: Accelerating Naive 4K Video Generation Over 10times

arXiv 2025

2025

StrandDesigner: Towards Practical Strand Generation with Sketch Guidance

arXiv 2025

2025

SVFR: A Unified Framework for Generalized Video Face Restoration

CVPR 2025 1

2025

DiP: Taming Diffusion Models in Pixel Space

arXiv 2025

2025

MobileMamba: Lightweight Multi-Receptive Visual Mamba Network

CVPR 2025 1

2024

Efficient Multimodal Large Language Models: A Survey

arXiv 2024

2024

MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection

arXiv 2024

2024

MMAD: The First-Ever Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection

arXiv 2024

2024

LLaVA-KD: A Framework of Distilling Multimodal Large Language Models

arXiv 2024

2024

DF40: Toward Next-Generation Deepfake Detection

arXiv 2024

2024

VTON-HandFit: Virtual Try-on for Arbitrary Hand Pose Guided by Hand Priors Embedding

CVPR 2025 1

2024

Learning Multi-view Anomaly Detection

arXiv 2024

2024

A Comprehensive Library for Benchmarking Multi-class Visual Anomaly Detection

arXiv 2024

2024

A Survey on Benchmarks of Multimodal Large Language Models

arXiv 2024

2024

Tuning-Free Image Customization with Image and Text Guidance

arXiv 2024

2024

EMOv2: Pushing 5M Vision Model Frontier

arXiv 2024

2024

CustAny: Customizing Anything from A Single Example

CVPR 2025 1

2024

LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description

arXiv 2024

2024

FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on

arXiv 2024

2024

Deep Industrial Image Anomaly Detection: A Survey

arXiv 2023

2023

Transferable Decoding with Visual Entities for Zero-Shot Image Captioning

ICCV 2023 1

2023

Stroke-based Neural Painting and Stylization with Dynamically Predicted Painting Region

arXiv 2023

2023

Rethinking Mobile Block for Efficient Attention-based Models

ICCV 2023 1

2023

Phasic Content Fusing Diffusion Model with Directional Distribution Consistency for Few-Shot Model Adaption

ICCV 2023 1

2023

Surface Representation for Point Clouds

arXiv 2022

2022

How to Reduce Change Detection to Semantic Segmentation

arXiv 2022

2022

Affiliations

No known affiliations.

Frequent co-authors

10

from 31 papers