Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
Ergebnis 14 von 173
IEEE transactions on computer-aided design of integrated circuits and systems, 2022-09, Vol.41 (9), p.2957-2969
2022
Volltextzugriff (PDF)

Details

Autor(en) / Beteiligte
Titel
NC-Net: Efficient Neuromorphic Computing Using Aggregated Subnets on a Crossbar-Based Architecture With Nonvolatile Memory
Ist Teil von
  • IEEE transactions on computer-aided design of integrated circuits and systems, 2022-09, Vol.41 (9), p.2957-2969
Ort / Verlag
New York: IEEE
Erscheinungsjahr
2022
Quelle
IEEE Electronic Library (IEL)
Beschreibungen/Notizen
  • Neuromorphic computing chips consisting of crossbar arrays of emergent nonvolatile memory (NVM) have the potential of achieving both high energy efficiency and throughput as the low-power implementation of convolutional neural network (CNN) inference engines. However, such hardware has design constraints, such as its limited fan-in/fan-out and resource-inefficient mapping, that make the design and deployment of CNN on them challenging. As a result, the user has to design the CNN model with intricate knowledge of the hardware architecture and even cannot fit the models in the hardware for CNN with high resolution image input. In this article, we propose the use of aggregated subnets , NC-net, which is a constrained form of the traditional layer structure, to solve these issues. With our method, we put forward an energy-efficient buffer- and analogue-to-digital converter and digital-to-analogue converter (ADC/DAC)-free architecture and a scalable end-to-end solution that automatically satisfies the hardware constraints of crossbar architectures, while optimizing the resource usage. In our solution, the exploration and deployment of a CNN for a neuromorphic crossbar hardware start with a design front end based on TensorFlow . Our automated design flow maps the NC-net network from TensorFlow to the crossbar architecture. We tested our designs on both a simulator and a field-programmable gate array (FPGA) emulator with various benchmarks. In addition to general benchmarks, including MNIST, SVHN, CIFAR-10, and CIFAR-100, we tested our system on a real-world application, human detection with high resolution (224 <inline-formula> <tex-math notation="LaTeX">\times </tex-math></inline-formula> 224) images as the input. Our system achieves the state-of-the-art accuracy for these benchmarks on the crossbar-based neuromorphic hardware, with an accuracy of more than 90% for the latter. It also yielded up to <inline-formula> <tex-math notation="LaTeX">4.25\times </tex-math></inline-formula> improvement in the efficiency of spiking core usage compared to TrueNorth.

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX