UB Paderborn / Katalog / Suche / Details

Zur Ergebnisliste

Ergebnis 7 von 264

NTD: Non-Transferability Enabled Deep Learning Backdoor Detection

IEEE transactions on information forensics and security, 2024, Vol.19, p.104-119

2024

Volltextzugriff (PDF)

Details

Autor(en) / Beteiligte

Titel

NTD: Non-Transferability Enabled Deep Learning Backdoor Detection

Ist Teil von

IEEE transactions on information forensics and security, 2024, Vol.19, p.104-119

Ort / Verlag

IEEE

Erscheinungsjahr

2024

Quelle

IEEE Xplore

Beschreibungen/Notizen

To mitigate recent insidious backdoor attacks on deep learning models, advances have been made by the research community. Nonetheless, state-of-the-art defenses are either limited to specific backdoor attacks (i.e., source-agnostic attacks) or non-user-friendly in that machine learning expertise and/or expensive computing resources are required. This work observes that all existing backdoor attacks have an inadvertent and inevitable intrinsic weakness, termed as non-transferability -that is, a trigger input hijacks a backdoored model but is not effective in another model that has not been implanted with the same backdoor. With this key observation, we propose non-transferability enabled backdoor detection to identify trigger inputs for a model-under-test during run-time. Specifically, our detection allows a potentially backdoored model-under-test to predict a label for an input. Moreover, our detection leverages a feature extractor to extract feature vectors for the input and a group of samples randomly picked from its predicted class label, and then compares the similarity between the input and the samples in the feature extractor's latent space to determine whether the input is a trigger input or a benign one. The feature extractor can be provided by a reputable party or is a free pre-trained model privately reserved from any open platform (e.g., ModelZoo, GitHub, Kaggle) by a user and thus our detection does not require the user to have any machine learning expertise or perform costly computations. Extensive experimental evaluations on four common tasks affirm that our detection scheme has high effectiveness (low false acceptance rate) and usability (low false rejection rate) with low detection latency against different types of backdoor attacks.

Sprache: Englisch
Identifikatoren: ISSN: 1556-6013
eISSN: 1556-6021
DOI: 10.1109/TIFS.2023.3312973
Titel-ID: cdi_ieee_primary_10243095

Format: –
Schlagworte: Backdoor countermeasure, Computational modeling, deep learning, Face recognition, Feature extraction, Iron, non-transferability, NTD, Speech recognition, Task analysis, Training

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX

Menü

Weitere Dienste

Einstellungen

NTD: Non-Transferability Enabled Deep Learning Backdoor Detection

Details

Weiterführende Literatur