Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
35th (2021)
Session ID : 3I4-GS-7a-02
Conference information

Improvement of object detection inference speed and accuracy in edge environment
*Yoshiyuki KONISHIDaisuke KUNIMUNEYoshitaka NISHIDA
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

In recent years, the demand for edge AI has been expanding from the viewpoint of real-time performance and data confidentiality. We use the QAT (Quantization Aware Training) method of TensorFlow and TensorFlow Lite to realize speed up and memory saving in edge AI. In the current situation where new AI models are being devised one after another, it is unlikely that the QAT will support all operations. Therefore, depending on the AI model used, there is a problem that speed and accuracy will decrease due to the inclusion of unsupported operations. In this paper, we will take YOLOv3-tiny, an object detection model in which such a problem occurs, as an example to propose methods for improving speed and accuracy. We were able to half the inference time on the Raspberry Pi 3 Model B+ and improve the inference accuracy to the same level as before quantization.

Content from these authors
© 2021 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top