実例に基づく強化学習法BRLにおける行動空間の分割法の改良 第1報, 移動ロボットのナビゲーション問題による検証

保田 俊行; 大倉 和博

doi:10.1299/kikaic.74.2747

抄録

The paper proposes an extended method for improving robustness of reinforcement learning called BRL. BRL has a novel character that the continuous state space and the continuous action space are segmented autonomously and simultaneously in the online-learning process. We have presented elsewhere that BRL is an effective technique not only for single robot problems but also for multi-robot problems. In BRL, the continuous state space is segmented by the Bayesian discrimination function method based on the instances perceived from each episode. On the other hand, the continuous action space is segmented by the same method based on randomly generated actions. This seems reasonable when a perceived state is apparently different from the states in the acquired rules. But it seems inappropriate when a perceived state is somewhat similar to the states in the acquired rules. Therefore, in the latter case, we propose an extension of BRL such that an action is calculated as the weighted linear interpolation of the actions in the similar rules. After showing the formalization of the proposed extension, the navigation problem of an autonomous mobile robot is demonstrated to verify the improvement by the proposed method through computer simulation as well as physical experiments.

著者関連情報

お気に入り & アラート

閲覧履歴

前身誌

後続誌

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）