Proceedings of the Annual Conference of the Institute of Systems, Control and Information Engineers
The 47th Annual Conference of the Institute of Systems, Control and Information Engineers
Conference information
Learning of the action-value function by the neural network, and application to waging-war game strategy acquisition
Tomoyuki NakanishiIkuko Nishikawa
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Pages 5010

Details
Abstract
A neural network is used as a function approximator of an action value function for reinforcement learning , in order to cope with a large number of discrete states. The learning of lambda return by the proposed network is based on a backward view of Sarsa(λ), which enables an on-line learning. The proposed method is applied to acquire heuristic strategy of a board game, which is known as Dots-and-Boxes. Computer experiments are executed for the learning by training matches competing with a mini-max method of the search depth 1.
Content from these authors
© 2003 The Institute of Systems, Control and Information Engineers
Previous article Next article
feedback
Top