IEICE Electronics Express
Online ISSN : 1349-2543
ISSN-L : 1349-2543

This article has now been updated. Please use the final version.

Design and Implementation of an Efficient CNN Accelerator for Low-Cost FPGAs
Yan XuShuaishuai WangNing LiHao Xiao
Author information
JOURNAL FREE ACCESS Advance online publication

Article ID: 19.20220370

Details
Abstract

This paper proposes a computation-array-centered dataflow, which adjusts the convolution with different kernel sizes to a unified computing manner and reduces the dimension of computation array from 2D to 1D, so as to maximize the utilization of the computation elements offered by the accelerator. Furthermore, a single unit multiple data (SUMD) strategy is proposed to effectively alleviate the mismatch between the quantized data and the hardware resources with fixed bit width on FPGA. As a case study, an 8-bit MobileNetV2 model has been implemented on the low-cost ZYNQ XC7Z020 FPGA, whose FPS/DSP and GOPS/DSP achieve upto 0.55 and 0.35 respectively.

Content from these authors
© 2022 by The Institute of Electronics, Information and Communication Engineers
feedback
Top