Skip to main navigation Skip to search Skip to main content

Vision transformers: The threat of realistic adversarial patches

  • Vrije Universiteit Brussel
  • Open University of the Netherlands
  • Netherlands Defence Academy
  • Fraunhofer IOSB
  • IMEC

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The increasing reliance on machine learning systems has made their security a critical concern. Evasion attacks enable adversaries to manipulate the decision-making processes of AI systems, potentially causing security breaches or misclassification of targets. Vision Transformers (ViTs) have gained significant traction in modern machine learning due to increased 1) performance compared to Convolutional Neural Networks (CNNs) and 2) robustness against adversarial perturbations. However, ViTs remain vulnerable to evasion attacks, particularly to adversarial patches, unique patterns designed to manipulate AI classification systems. These vulnerabilities are investigated by designing realistic adversarial patches to cause misclassification in person vs. non-person classification tasks using the Creases Transformation (CT) technique, which adds subtle geometric distortions similar to those occurring naturally when wearing clothing. This study investigates the transferability of adversarial attack techniques used in CNNs when applied to ViT classification models. Experimental evaluation across four fine-tuned ViT models on a binary person classification task reveals significant vulnerability variations: attack success rates ranged from 40.04% (google/vit-base-patch16-224-in21k) to 99.97% (facebook/dino-vitb16), with google/vit-base-patch16-224 achieving 66.40% and facebook/dinov3-vitb16 reaching 65.17%. These results confirm the cross-architectural transferability of adversarial patches from CNNs to ViTs, with pre-training dataset scale and methodology strongly influencing model resilience to adversarial attacks.

Original languageEnglish
Title of host publicationArtificial Intelligence for Security and Defence Applications III
EditorsHugo J. Kuijf, Radhakrishna Prabhu, Yitzhak Yitzhaky
PublisherSociety of Photo-Optical Instrumentation Engineers
ISBN (Electronic)9781510692978
DOIs
Publication statusPublished - 28 Oct 2025
Event3rd Artificial Intelligence for Security and Defence Applications - Madrid, Spain
Duration: 16 Sept 202518 Sept 2025

Publication series

NameProceedings of SPIE - The International Society for Optical Engineering
Volume13679
ISSN (Print)0277-786X
ISSN (Electronic)1996-756X

Conference

Conference3rd Artificial Intelligence for Security and Defence Applications
Country/TerritorySpain
CityMadrid
Period16/09/2518/09/25

Keywords

  • Adversarial Patches
  • Attack Transferability
  • Evasion Attacks
  • Vision Transformers

Fingerprint

Dive into the research topics of 'Vision transformers: The threat of realistic adversarial patches'. Together they form a unique fingerprint.

Cite this