Bitwise neural network acceleration: opportunities and challenges

Michel van Lier, Luc Waeijen, Henk Corporaal

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

1 Citation (Scopus)
1 Downloads (Pure)

Abstract

Real-time inference of deep convolutional neural networks (CNNs) on embedded systems and SoCs would enable many interesting applications. However these CNNs are computation and data expensive, making it difficult to execute them in real-time on energy constrained embedded platforms. Resent research has shown that light-weight CNNs with quantized model weights and activations constrained to one bit only {-1,+ 1} can still achieve reasonable accuracy, in comparison to the non quantized 32-bit model. These binary neural networks (BNNs) theoretically allow to drastically reduce the required energy and run-time by reduction of memory size, number of memory accesses, and finally computation power by replacing expensive two's complement arithmetic operations with more efficient bitwise versions. To make use of these advantages, we propose a bitwise CNN accelerator (BNNA) mapped on an FPGA. We implement the Hubara'16 network [1] on the Xilinx Zynq-7020 SoC. Massive parallelism is achieved performing 4608 parallel binary MACs in total, which enables us to archive real-time speed up to 110 fps, while using only 22% of the FPGA LUTs. In comparison to a 32-bit network, a speed up of 32 times is achieved, and a resource reduction of 40 times is achieved, where the memory bandwidth is the main bottleneck. The provided detailed analysis of the carefully crafted accelerator design exposes the challenges and opportunities in bitwise neural network accelerator design.

Original languageEnglish
Title of host publication2019 8th Mediterranean Conference on Embedded Computing, MECO 2019 - Proceedings
EditorsRadovan Stojanovic, Lech Jozwiak, Budimir Lutovac, Drazen Jurisic
Place of PublicationPiscataway
PublisherInstitute of Electrical and Electronics Engineers
Number of pages5
ISBN (Electronic)978-1-7281-1740-9
DOIs
Publication statusPublished - 1 Jun 2019
Event8th Mediterranean Conference on Embedded Computing, MECO 2019 - Budva, Montenegro
Duration: 10 Jun 201914 Jun 2019

Conference

Conference8th Mediterranean Conference on Embedded Computing, MECO 2019
CountryMontenegro
CityBudva
Period10/06/1914/06/19

Fingerprint Dive into the research topics of 'Bitwise neural network acceleration: opportunities and challenges'. Together they form a unique fingerprint.

Cite this