ASPoT8: An Efficient 8-bit Quantization Balancing Hardware and Accuracy for Object Detection

Hui Li; Xiaofeng Yang; Zebin Zheng; Jinyi Li; Shengli Lu

doi:10.1587/transinf.2024EDL8089

抄録

Hardware accelerators using fixed-point quantization efficiently run object detection neural networks, but high-bit quantization demands substantial hardware and power, while low-bit quantization sacrifices accuracy. To address this, we introduce an 8-bit quantization scheme, ASPoT8, which uses add/shift operations to replace INT8 multiplications, minimizing hardware area and power consumption without compromising accuracy. ASPoT8 adjusts quantified value distribution to match INT8's accuracy. Tests on YOLOV3 Tiny and MobileNetV2 SSDlite show minimal mAP drops of 0.5% and 0.2%, respectively, with significant reductions in power (76.31%), delay (29.46%), and area (58.40%) over INT8, based on SMIC 40nm.

著者関連情報

お気に入り & アラート

お気に入りに追加
追加情報アラート
被引用アラート
認証解除アラート

閲覧履歴

Effect of Fluid and Antibiotic Administration on Experimental Fecal Peritonitis
ヒヨドリHypsipetes amaurotis の鳴き出し時間のバラツキ比較（大磯、仙台、密陽）と鳴き出し時刻の照度推定
[title in Japanese]
[title in Japanese]
目次

発行機関からのお知らせ

PPV is available from https://globals.ieice.org/en_transactions/information

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）