物体検出におけるエッジ環境での推論高速化と精度向上

小西 祥之; 国宗 大介; 西田 芳隆

doi:10.11517/pjsai.JSAI2021.0_3I4GS7a02

35th (2021)

Session ID : 3I4-GS-7a-02

DOI https://doi.org/10.11517/pjsai.JSAI2021.0_3I4GS7a02

Conference information

Host: The Japanese Society for Artificial Intelligence

Name : The 35th Annual Conference of the Japanese Society for Artificial Intelligence

Number : 35

Location : [in Japanese]

Date : June 08, 2021 - June 11, 2021

Improvement of object detection inference speed and accuracy in edge environment

*Yoshiyuki KONISHI, Daisuke KUNIMUNE, Yoshitaka NISHIDA

Author information

Keywords: Object Detection, Edge AI, Speeding up Inference, Quantization, TensorFlowLite

CONFERENCE PROCEEDINGS FREE ACCESS

Details

Abstract

In recent years, the demand for edge AI has been expanding from the viewpoint of real-time performance and data confidentiality. We use the QAT (Quantization Aware Training) method of TensorFlow and TensorFlow Lite to realize speed up and memory saving in edge AI. In the current situation where new AI models are being devised one after another, it is unlikely that the QAT will support all operations. Therefore, depending on the AI model used, there is a problem that speed and accuracy will decrease due to the inclusion of unsupported operations. In this paper, we will take YOLOv3-tiny, an object detection model in which such a problem occurs, as an example to propose methods for improving speed and accuracy. We were able to half the inference time on the Raspberry Pi 3 Model B+ and improve the inference accuracy to the same level as before quantization.

Corresponding author

Conference information

Register with J-STAGE for free!