Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
Ergebnis 7 von 264
IEEE transactions on information forensics and security, 2024, Vol.19, p.104-119
2024
Volltextzugriff (PDF)

Details

Autor(en) / Beteiligte
Titel
NTD: Non-Transferability Enabled Deep Learning Backdoor Detection
Ist Teil von
  • IEEE transactions on information forensics and security, 2024, Vol.19, p.104-119
Ort / Verlag
IEEE
Erscheinungsjahr
2024
Quelle
IEEE Xplore
Beschreibungen/Notizen
  • To mitigate recent insidious backdoor attacks on deep learning models, advances have been made by the research community. Nonetheless, state-of-the-art defenses are either limited to specific backdoor attacks (i.e., source-agnostic attacks) or non-user-friendly in that machine learning expertise and/or expensive computing resources are required. This work observes that all existing backdoor attacks have an inadvertent and inevitable intrinsic weakness, termed as non-transferability -that is, a trigger input hijacks a backdoored model but is not effective in another model that has not been implanted with the same backdoor. With this key observation, we propose non-transferability enabled backdoor detection to identify trigger inputs for a model-under-test during run-time. Specifically, our detection allows a potentially backdoored model-under-test to predict a label for an input. Moreover, our detection leverages a feature extractor to extract feature vectors for the input and a group of samples randomly picked from its predicted class label, and then compares the similarity between the input and the samples in the feature extractor's latent space to determine whether the input is a trigger input or a benign one. The feature extractor can be provided by a reputable party or is a free pre-trained model privately reserved from any open platform (e.g., ModelZoo, GitHub, Kaggle) by a user and thus our detection does not require the user to have any machine learning expertise or perform costly computations. Extensive experimental evaluations on four common tasks affirm that our detection scheme has high effectiveness (low false acceptance rate) and usability (low false rejection rate) with low detection latency against different types of backdoor attacks.
Sprache
Englisch
Identifikatoren
ISSN: 1556-6013
eISSN: 1556-6021
DOI: 10.1109/TIFS.2023.3312973
Titel-ID: cdi_ieee_primary_10243095

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX