1A1-K05 フラクタル次元解析を用いた強化学習の収束推定法の開発(進化・学習とロボティクス)

村田 雄太; 河野 仁; 神村 明哉; 富田 康治; 鈴木 剛

doi:10.1299/jsmermd.2014._1A1-K05_1

抄録

This paper describes a convergence estimation method and learning termination method for reinforcement learning in dynamic environment. In recent years, multi-robot systems utilizing reinforcement learning have been developing in real-world situations. However, conventional learning methods take a considerable amount of time to reach convergence. Furthermore, conventional learning processes are often inefficient because robot continues reinforcement learning even if learning converges. In response to this problem, we propose a Knowledge Co-creation Framework (KCF) for multi-robot systems, whose efficient implementation requires an autonomous convergence estimation method and learning termination method for reinforcement learning. Therefore,on basis of the assumption that learning curves exhibit fractality, we propose the convergence estimation method and the learning termination method utilizing a fractal dimensional analysis. Furthermore, we confirmed that the proposed method determines the learning convergence and terminates the reinforcement learning by conducting a computer simulation.

著者関連情報

お気に入り & アラート

閲覧履歴

発行機関からのお知らせ

会員向け購読者番号とパスワードは以下URLよりご確認下さい。
https://www.jsme.or.jp/publication/proceedings/

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）