Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
38th (2024)
Session ID : 3P5-OS-17a-03
Conference information

Systematic design of artificial deep neural networks based on scaling laws in signal propagation
*Keiichi TAMAITsuyoshi OKUBOTruong Vinh Truong DUYNaotake NATORISynge TODO
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

For more environmentally sustainable development of deep learning (DL) technologies, computational burden for tuning DL architectures should be reduced. This calls for more systematic strategies for finding an optimal set of hyperparameters which achieves a good balance between training speed and generalization performance. As a preliminary step toward this goal, we address the problem of how to tune fully-connected feedforward perceptrons in the so-called ``kernel regime'' in a systematic manner. By combining the existing theoretical tools, such as the Neural Tangent Kernel (NTK), and the analogy of the signal propagation dynamics with absorbing phase transitions, we conduct thorough analysis of the training dynamics of the neural network, including the case with finite depth. As a result, a simple strategy for optimally tuning the initialization hyperparameters and the depth is proposed.

Content from these authors
© 2024 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top