IEICE Communications Express
Online ISSN : 2187-0136
ISSN-L : 2187-0136

This article has now been updated. Please use the final version.

Applying Double DQN to Reinforcement learning of Automated Designing ICT System
Natsuki OkamuraYutaka YakuwaTakayuki KurodaIkuko E. Yairi
Author information
JOURNAL FREE ACCESS Advance online publication

Article ID: 2022XBL0100

Details
Abstract

Designing an ICT system providing certain network application services (DICTS), which consists of selecting appropriate equipment for various requirements, optimal arrangements, and correct connections, needs specialized knowledge and enormous human power. Autonomous DICTS technology using deep reinforcement learning (DRL) has an elementary problem of huge learning time caused by accidentally overestimating a specific configuration because of the sparse reward despite a vast combination of selections, arrangements, and connections. This paper applies our improved Double DQN, a typical DRL algorithm to suppress overestimation, to an autonomous DICTS technology named Weaver and demonstrates the possibility of reducing the number of episodes until convergence by 25%.

Content from these authors
© 2022 The Institute of Electronics, Information and Communication Engineers
feedback
Top