Efficient Hardware Accelerator for Compressed Sparse Deep Neural Network

Hao XIAO; Kaikai ZHAO; Guangzhu LIU

doi:10.1587/transinf.2020EDL8153

抄録

This work presents a DNN accelerator architecture specifically designed for performing efficient inference on compressed and sparse DNN models. Leveraging the data sparsity, a runtime processing scheme is proposed to deal with the encoded weights and activations directly in the compressed domain without decompressing. Furthermore, a new data flow is proposed to facilitate the reusage of input activations across the fully-connected (FC) layers. The proposed design is implemented and verified using the Xilinx Virtex-7 FPGA. Experimental results show it achieves 1.99×, 1.95× faster and 20.38×, 3.04× more energy efficient than CPU and mGPU platforms, respectively, running AlexNet.

著者関連情報

お気に入り & アラート

お気に入りに追加
追加情報アラート
被引用アラート
認証解除アラート

閲覧履歴

Beclomethasone dipropionate (BDP) 吸入療法施行中の喘息患児における副腎皮質機能の検討
Sato's conjecture on recurrence conditions for multidimensional processes of Ornstein-Uhlenbeck type
Fossil whale barnacles (Cirripedia: Thoracica: Coronuloidea) of Japan
CIRCADIAN VARIATION IN CIRCULATING BLOOD VOLUME
Influence of Heat Treatment on the Microstructure and Magnetic Properties of Mn–Sn–Co–N Alloys

発行機関からのお知らせ

PPV is available from https://globals.ieice.org/en_transactions/information

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）