強化学習を用いた能動知覚方策の学習

畠中 渉; 佐々木 史紘; 山科 亮太

doi:10.1299/jsmermd.2020.2A1-L02

抄録

We propose a learning algorithm for an autonomous robot to acquire the observation skill that is advantageous for achieving its task. We consider the robot has movement and observation agents separately; the observation agent learns a policy for providing the observation for the movement agent, which learns how to achieve tasks better. Each policy is trained separately, and the observation policy is updated by using the differential value function before and after the movement policy is learned by the observation given by itself. Experiments on 2D navigation tasks in simulation show that our algorithm is more successful than conventional methods for the situation in which agent’s view is narrow.

著者関連情報

お気に入り & アラート

お気に入りに追加
追加情報アラート
被引用アラート
認証解除アラート

閲覧履歴

Locomotion Control of MEMS Micro Robot Using Pulse-Type Hardware Neural Networks
DAMAGE TO FOUNDATIONS OF RAILWAY STRUCTURES
尿道カテーテルによる医原性重度尿道下裂の治療経験
3種類の2×2分割表に関する研究
Hot-Atom Chemistry of Gas-and Liquid-Phase Systems

発行機関からのお知らせ

会員向け購読者番号とパスワードは以下URLよりご確認下さい。
https://www.jsme.or.jp/publication/proceedings/

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）