A Proposal for Enhancing Training Speed in Deep Learning Models Based on Memory Activity Survey

Dang Tuan Kiet; Binh Kieu-Do-Nguyen; Trong-Thuc Hoang; Khai-Duy Nguyen; Xuan-Tu Tran; Cong-Kha Pham

doi:10.1587/elex.18.20210252

This article has now been updated. Please use the final version.

A Proposal for Enhancing Training Speed in Deep Learning Models Based on Memory Activity Survey

Dang Tuan Kiet, Binh Kieu-Do-Nguyen, Trong-Thuc Hoang, Khai-Duy Nguyen, Xuan-Tu Tran, Cong-Kha Pham

Author information

Keywords: Deep Learning, Memory, Survey, Training Speed

JOURNAL FREE ACCESS Advance online publication

Article ID: 18.20210252

DOI https://doi.org/10.1587/elex.18.20210252

The final version of this article is now available: Vol. 18 (2021), No. 15 pp. 20210252-20210252

Details

Abstract

Deep Learning (DL) training process involves intensive computations that require a large number of memory accesses. There are many surveys on memory behaviors with the DL training. They use well-known profiling tools or improving the existing tools to monitor the training processes. This paper presents a new approach to profile using a co-operate solution from software and hardware. The idea is to use Field-Programmable-Gate-Array memory as the main memory for the DL training processes on a computer. Then, the memory behaviors from both software and hardware point-of-views can be monitored and evaluated. The most common DL models are selected for the tests, including ResNet, VGG, AlexNet, and GoogLeNet. The CIFAR-10 dataset is chosen for the training database. The experimental results show that the ratio between read and write transactions is roughly about 3 to 1. The requested allocations are varied from 2-Byte to 64-MB, with the most requested sizes are approximately 16-KB to 64-KB. Based on the statistic, a suggestion was made to improve the training speed using an L4 cache for the Double-Data-Rate (DDR) memory. It can be demonstrated that our recommended L4 cache configuration can improve the DDR performance by about 15% to 18%.

Corresponding author

Register with J-STAGE for free!