Multi-phase-quantization optimizer and its architecture for edge AI training

Itsuki Akeno; Hiiro Yamazaki; Tetsuya Asai; Kota Ando

doi:10.1587/nolta.16.43

Special Section on Recent Advances in Nonlinear Problems

Multi-phase-quantization optimizer and its architecture for edge AI training

Itsuki Akeno, Hiiro Yamazaki, Tetsuya Asai, Kota Ando

Author information

Keywords: artificial intelligence, edge AI, hardware architecture, neural network, optimizer

JOURNAL OPEN ACCESS

2025 Volume 16 Issue 1 Pages 43-63

DOI https://doi.org/10.1587/nolta.16.43

Details

Abstract

Developing hardware for Artificial intelligence (AI) training is vital. A hardware-oriented optimizer, named Holmes enables faster training with a smaller memory footprint. This study developed a hardware architecture that incorporates Holmes and benefits from parallelization and pipelining to achieve significant throughput improvement. We determined the required bit width for training and used it the architecture evaluation. We investigated scalability and the effectiveness of both Holmes and pipelining. The results proved the linear scalability of the memory footprint over the model size, reduction of the memory footprint by utilizing Holmes, drastic increase in throughput by pipelining and faster computing.

Corresponding author

Register with J-STAGE for free!