Skip to main navigation Skip to search Skip to main content

Fortify the Guardian, Not the Treasure: Resilient Adversarial Detectors

Raz Lapid, Almog Dubin, Moshe Sipper

Research output: Contribution to journalArticlepeer-review

Abstract

Adaptive adversarial attacks, where adversaries tailor their strategies with full knowledge of defense mechanisms, pose significant challenges to the robustness of adversarial detectors. In this paper, we introduce RADAR (Robust Adversarial Detection via Adversarial Retraining), an approach designed to fortify adversarial detectors against such adaptive attacks while preserving the classifier’s accuracy. RADAR employs adversarial training by incorporating adversarial examples—crafted to deceive both the classifier and the detector—into the training process. This dual optimization enables the detector to learn and adapt to sophisticated attack scenarios. Comprehensive experiments on CIFAR-10, SVHN, and ImageNet datasets demonstrate that RADAR substantially enhances the detector’s ability to accurately identify adaptive adversarial attacks without degrading classifier performance.

Original languageAmerican English
Article number3451
JournalMathematics
Volume12
Issue number22
DOIs
StatePublished - 1 Nov 2024

Keywords

  • adaptive adversarial attacks
  • adversarial attacks
  • deep learning
  • robustness

All Science Journal Classification (ASJC) codes

  • Computer Science (miscellaneous)
  • General Mathematics
  • Engineering (miscellaneous)

Fingerprint

Dive into the research topics of 'Fortify the Guardian, Not the Treasure: Resilient Adversarial Detectors'. Together they form a unique fingerprint.

Cite this