1A1-L06 人からの報酬と罰の逐次的な教示を利用するロボット学習モデル(進化・学習とロボティクス)

田中 爽太; 廣川 暢一; 鈴木 健嗣

doi:10.1299/jsmermd.2014._1A1-L06_1

抄録

Reinforcement Learning is a machine learning method to acquire a series of actions that maximizes a cumulative reward. However, it is difficult to optimize interaction between human and robot in a daily living space because there is no definite evaluation standard about undesirable actions. In this study, we propose a novel learning model using a successive reward and punishment based on human subjective evaluation. In this method, we developed human can restrain undesirable actions by giving punishment evaluation. We developed a dog-like robot to verify the proposed method and demonstrated its performance through the experiment.

著者関連情報

お気に入り & アラート

お気に入りに追加
追加情報アラート
被引用アラート
認証解除アラート

閲覧履歴

Mineralogical study of micro-inclusions in olivine in pallasite meteorite
Examination on Scaling Up of Service Business
Digital Twin of Intermediate Layer Temperatures Observation of IGBT Modules
[title in Japanese]
VO₂-SiO₂ Nano-Hybrid Particles by Nano-Coating on Monodispersed SiO₂ Particles

発行機関からのお知らせ

会員向け購読者番号とパスワードは以下URLよりご確認下さい。
https://www.jsme.or.jp/publication/proceedings/

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）