ロボティクス・メカトロニクス講演会講演概要集
Online ISSN : 2424-3124
セッションID: 2P1-J24
会議情報
2P1-J24 価値システムに基づく他者行為観察と自己行動学習の循環的発達(ロボカップ・ロボットコンテスト)
田村 佳宏高橋 泰岳浅田 稔
著者情報
会議録・要旨集 フリー

詳細
抄録
Both self-learning architecture (embedded structure) and explicit/implicit teaching from other agents (environmental design issue) are necessary not only for one behavior learning but more seriously for life-time behavior learning. This paper presents a method for a robot to understand unfamiliar behaviors shown by others through the collaboration between behavior acquisition and recognition of observed behaviors, where the state value has an important role not simply for behavior acquisition (reinforcement learning) but also for behavior recognition (observation). That is, the state value updates can be accelerated by observation without real trials and errors while the learned values enrich the recognition system since it is based on estimation of the state value of the observed behavior. The validity of the proposed method is shown by applying it to a dynamic environment where two robots play soccer.
著者関連情報
© 2008 一般社団法人 日本機械学会
前の記事 次の記事
feedback
Top