0

Robust Evaluation of Diffusion-Based Adversarial Purification

The paper critiques current evaluation practices for diffusion-based purification methods and proposes a new strategy for improved robustness against adversarial attacks.

Year
2023
Venue
ICCV 2023 1
Authors
2
Hosting
Abstract onlyARXIV-DEFAULT

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/2303.09051v3ARXIV-DEFAULT
TL;DR
Semantic Scholar
Attribution policy →

Abstract

We question the current evaluation practice on diffusion-based purification methods. Diffusion-based purification methods aim to remove adversarial effects from an input data point at test time. The approach gains increasing attention as an alternative to adversarial training due to the disentangling between training and testing. Well-known white-box attacks are often employed to measure the robustness of the purification. However, it is unknown whether these attacks are the most effective for the diffusion-based purification since the attacks are often tailored for adversarial training. We analyze the current practices and provide a new guideline for measuring the robustness of purification methods against adversarial attacks. Based on our analysis, we further propose a new purification strategy improving robustness compared to the current diffusion-based purification methods.

Authors

2