Evading Adversarial Example Detection Defenses with Orthogonal Projected Gradient Descent.

Oliver Bryniarski,Nabeel Hingun,Pedro Pachuca,Vincent Wang,Nicholas Carlini

Evading Adversarial Example Detection Defenses with Orthogonal Projected Gradient Descent.

2021

Oliver Bryniarski
Nabeel Hingun
Pedro Pachuca
Vincent Wang
Nicholas Carlini

Evading adversarial example detection defenses requires finding adversarial examples that must simultaneously (a) be misclassified by the model and (b) be detected as non-adversarial. We find that existing attacks that attempt to satisfy multiple simultaneous constraints often over-optimize against one constraint at the cost of satisfying another. We introduce Orthogonal Projected Gradient Descent, an improved attack technique to generate adversarial examples that avoids this problem by orthogonalizing the gradients when running standard gradient-based attacks. We use our technique to evade four state-of-the-art detection defenses, reducing their accuracy to 0% while maintaining a 0% detection rate.

Keywords:

Computer science
Gradient descent
Adversarial system
detection rate
constraint
Mathematical optimization

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations