UB Paderborn / Katalog / Suche / Details

Ergebnis 23 von 59

2021 18th International SoC Design Conference (ISOCC), 2021, p.413-414

2021

Autor(en) / Beteiligte

Titel

Understanding and Reducing Weight-Load Overhead of Systolic Deep Learning Accelerators

Ist Teil von

Ort / Verlag

IEEE

Erscheinungsjahr

2021

Link zum Volltext

Quelle

IEEE/IET Electronic Library

Beschreibungen/Notizen

As an energy-efficient computing engine for deep neural network inference, 2D systolic array architectures have been widely adopted in modern deep learning accelerators. However, despite high compute density and energy-efficient data passing, systolic accelerators suffer a non-trivial overhead of loading data stationed inside their local register file. This loading overhead becomes a critical issue when a frequent reload of stationary data (e.g., weight parameters) is required. This paper proposes a simple yet practical SW-HW co-optimization that reverses the weight-load order and adds a dedicated path for weight-load. On diverse deep learning applications, the proposed method reduces the weight-load overhead and achieves up to 1.8× speedup with 40% energy savings.

Sprache: Englisch
Identifikatoren: DOI: 10.1109/ISOCC53507.2021.9613929
Titel-ID: cdi_ieee_primary_9613929

Format: –
Schlagworte: accelerator, Computer architecture, Deep learning, deep neural network, Engines, Loading, Registers, systolic array, Systolic arrays

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX