Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...

Details

Autor(en) / Beteiligte
Titel
LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks
Ist Teil von
  • Computer Vision – ECCV 2018, p.373-390
Ort / Verlag
Cham: Springer International Publishing
Link zum Volltext
Quelle
Alma/SFX Local Collection
Beschreibungen/Notizen
  • Although weight and activation quantization is an effective approach for Deep Neural Network (DNN) compression and has a lot of potentials to increase inference speed leveraging bit-operations, there is still a noticeable gap in terms of prediction accuracy between the quantized model and the full-precision model. To address this gap, we propose to jointly train a quantized, bit-operation-compatible DNN and its associated quantizers, as opposed to using fixed, handcrafted quantization schemes such as uniform or logarithmic quantization. Our method for learning the quantizers applies to both network weights and activations with arbitrary-bit precision, and our quantizers are easy to train. The comprehensive experiments on CIFAR-10 and ImageNet datasets show that our method works consistently well for various network structures such as AlexNet, VGG-Net, GoogLeNet, ResNet, and DenseNet, surpassing previous quantization methods in terms of accuracy by an appreciable margin. Code available at https://github.com/Microsoft/LQ-Nets.
Sprache
Englisch
Identifikatoren
ISBN: 9783030012366, 3030012360
ISSN: 0302-9743
eISSN: 1611-3349
DOI: 10.1007/978-3-030-01237-3_23
Titel-ID: cdi_springer_books_10_1007_978_3_030_01237_3_23

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX