Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
Ergebnis 12 von 37822
IEEE transactions on pattern analysis and machine intelligence, 2022-08, Vol.44 (8), p.4388-4403
2022

Details

Autor(en) / Beteiligte
Titel
Self-Distillation: Towards Efficient and Compact Neural Networks
Ist Teil von
  • IEEE transactions on pattern analysis and machine intelligence, 2022-08, Vol.44 (8), p.4388-4403
Ort / Verlag
United States: IEEE
Erscheinungsjahr
2022
Link zum Volltext
Quelle
IEEE Xplore
Beschreibungen/Notizen
  • Remarkable achievements have been obtained by deep neural networks in the last several years. However, the breakthrough in neural networks accuracy is always accompanied by explosive growth of computation and parameters, which leads to a severe limitation of model deployment. In this paper, we propose a novel knowledge distillation technique named self-distillation to address this problem. Self-distillation attaches several attention modules and shallow classifiers at different depths of neural networks and distills knowledge from the deepest classifier to the shallower classifiers. Different from the conventional knowledge distillation methods where the knowledge of the teacher model is transferred to another student model, self-distillation can be considered as knowledge transfer in the same model - from the deeper layers to the shallow layers. Moreover, the additional classifiers in self-distillation allow the neural network to work in a dynamic manner, which leads to a much higher acceleration. Experiments demonstrate that self-distillation has consistent and significant effectiveness on various neural networks and datasets. On average, 3.49 and 2.32 percent accuracy boost are observed on CIFAR100 and ImageNet. Besides, experiments show that self-distillation can be combined with other model compression methods, including knowledge distillation, pruning and lightweight model design.
Sprache
Englisch
Identifikatoren
ISSN: 0162-8828
eISSN: 1939-3539
DOI: 10.1109/TPAMI.2021.3067100
Titel-ID: cdi_ieee_primary_9381661

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX