既学習モジュールの切替による未知環境探索エージェントの行動制御

畠 崇人; 菅沼 雅徳; 長尾 智晴

doi:10.1541/ieejeiss.138.157

抄録

In this paper, we try to acquire various behavior patterns of autonomous exploration agent using several learning environments. In case of previous learning methods using a single behavior rule set, it is hard to acquire the behavior that covers all learning environments. In our method, we divide learning environments into some primitive environments whose properties differ each other, and then generate modules that are specialized for each primitive environment. To optimize behavior rules of agents, we adopt Graph Structured Program Evolution (GRAPE) which can automatically generates graph structured programs. In unknown environments, each module is switched by a program named “switcher”. The switcher selects the module that acts better in a neighboring environment. Through several experiments, our method achieved higher exploration rate in unknown environments compared to simple GRAPE, random search, and the method that switches modules randomly.

著者関連情報

お気に入り & アラート

閲覧履歴

発行機関からのお知らせ

【電気学会会員の方】購読している論文誌を無料でご覧いただけます（会員ご本人のみの個人としての利用に限ります）。購読者番号欄にMyページへのログインIDを，パスワード欄に生年月日8ケタ（西暦，半角数字。例：19800303）を入力して下さい。

ダウンロード

論文(PDF)の閲覧方法はこちら
閲覧方法 (389.7K)

前身誌

電気学会論文誌. C

電氣學會雜誌

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）