UB Paderborn / Katalog / Suche / Details

Computer Vision – ECCV 2018, p.373-390

Autor(en) / Beteiligte

Titel

LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks

Ist Teil von

Ort / Verlag

Cham: Springer International Publishing

Link zum Volltext

Quelle

Alma/SFX Local Collection

Beschreibungen/Notizen

Although weight and activation quantization is an effective approach for Deep Neural Network (DNN) compression and has a lot of potentials to increase inference speed leveraging bit-operations, there is still a noticeable gap in terms of prediction accuracy between the quantized model and the full-precision model. To address this gap, we propose to jointly train a quantized, bit-operation-compatible DNN and its associated quantizers, as opposed to using fixed, handcrafted quantization schemes such as uniform or logarithmic quantization. Our method for learning the quantizers applies to both network weights and activations with arbitrary-bit precision, and our quantizers are easy to train. The comprehensive experiments on CIFAR-10 and ImageNet datasets show that our method works consistently well for various network structures such as AlexNet, VGG-Net, GoogLeNet, ResNet, and DenseNet, surpassing previous quantization methods in terms of accuracy by an appreciable margin. Code available at https://github.com/Microsoft/LQ-Nets.

Sprache: Englisch
Identifikatoren: ISBN: 9783030012366, 3030012360
ISSN: 0302-9743
eISSN: 1611-3349
DOI: 10.1007/978-3-030-01237-3_23
Titel-ID: cdi_springer_books_10_1007_978_3_030_01237_3_23

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX