Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
36th (2022)
Session ID : 3J3-OS-3a-04
Conference information

An Impact of Weight Initialization on Model Evaluations in Neural Architecture Search
*Nozomu YOSHINARIShinichi SHIRAKAWA
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

Architecture is one key factor determining neural networks' performance, and neural architecture search, which aims at finding competent architectures without human effort, is one of the most intensive research areas of automated machine learning. While most papers in the area focused only on architecture, recent research show performance of architecture depends on other hyperparameters such as learning rate, and simultaneous optimization of them is needed to obtain a better model. This research focuses on weight initialization methods and investigates their impact on the performance of architectures after training. Through experiments on the architectures defined in NAS-Bench-201, we found an initialization method considering architecture significantly improved the performance of many models.

Content from these authors
© 2022 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top