Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
37th (2023)
Session ID : 2D4-GS-2-04
Conference information

Performance evaluation of GPU-based CNN for object detection on SoC system
*Takuya UESUGIMasato GOCHOYuta KAWAKAMIHiroshi SAKAMAKI
Author information
Keywords: CNN, SoC, GPU
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

CNN (Convolutional Neural Network), which is used for image recognition and object detection and has recently been implemented in the System on Chip (SoC) environment, has a large amount of computation in the convolution layer, so performance may be degraded in the SoC environment. In this research, as an investigation of speeding up Convolution calculations for GPUs on SoC, an algorithm that solves Convolution operation by matrix multiplication was implemented with OpenCL, and the processing time of the object detection algorithm YOLO-Nano was measured on Intel and Qualcomm SoCs. As a result, compared to the Tensorflow Lite CPU, Qualcomm and Intel CPUs achieved speed improvement effects of 1.04 times and 6.37 times, respectively, and GPUs achieved speed improvement effects of 1.21 times and 1.52 times, respectively.

Content from these authors
© 2023 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top