Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
Ergebnis 3 von 110
2021 34th International Conference on VLSI Design and 2021 20th International Conference on Embedded Systems (VLSID), 2021, p.240-245
2021
Volltextzugriff (PDF)

Details

Autor(en) / Beteiligte
Titel
Demystifying Compression Techniques in CNNs: CPU, GPU and FPGA cross-platform analysis
Ist Teil von
  • 2021 34th International Conference on VLSI Design and 2021 20th International Conference on Embedded Systems (VLSID), 2021, p.240-245
Ort / Verlag
IEEE
Erscheinungsjahr
2021
Quelle
IEEE/IET Electronic Library (IEL)
Beschreibungen/Notizen
  • Convolutional Neural Networks (CNNs) are known for their high-performance despite its huge memory requirement and computational complexity. A wide range of compression techniques to reduce the number of parameters and hence computational and memory complexity have been exploring recently. In this paper, we analyse three widely used categories of techniques viz. quantization, pruning and tensor decomposition to make a cross-platform performance comparison on CPU, GPU and FPGA. These techniques are not mutually exclusive and hence can be combined to get better compression and a better speed-up on devices. Our focus is to highlight the contrasting impact of optimization techniques on devices and performance objectives. We observe a speed-up of 3.8 to 15.6× on CPU, 3.4 to 7.2× on GPU and 10.5 to 29.4×on FPGA across the models and compression techniques under consideration. We also achieved a compression of 93 to 97% across models with acceptable accuracy. Blended techniques have shown a better speed-up on FPGA compared to CPU and GPU as the caching effects, memory accesses and compiler optimizations slow down the inference on these general-purpose machines.
Sprache
Englisch
Identifikatoren
eISSN: 2380-6923
DOI: 10.1109/VLSID51830.2021.00046
Titel-ID: cdi_ieee_primary_9407445

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX