言語測度を用いたスーパバイザの強化学習

谷口 和隆; 潮 俊光; 山崎 達志

doi:10.11509/sci.SCI04.0.228.0

Abstract

This paper proposes a synthesis method of an optimal supervisor in terms of a language measure by using a reinforcement learning.Recently, a concept of the language measure is introduced to the formal languages and a synthesis method of an optimal supervisor based on the language measure has been proposed.In this paper, we apply the reinforcement learning as a learning method of the language measure, and show that the optimal supervisor in terms of the language measure can be derived through learning.By computer simulation, we examine an opimality of the obtained supervisor.

Content from these authors

Favorites & Alerts

Add to favorites
Additional info alert
Citation alert
Authentication alert

Corresponding author

Register with J-STAGE for free!