移動ロボットの自律走行のための深層強化学習で与える報酬による走行経路への影響

古谷 琢海; 加藤 勇気; 森岡 一幸

doi:10.1299/jsmermd.2019.2P2-A02

セッションID: 2P2-A02

DOI https://doi.org/10.1299/jsmermd.2019.2P2-A02

会議情報

主催: 一般社団法人日本機械学会

会議名: ロボティクス・メカトロニクス　講演会2019

開催日: 2019/06/05 - 2019/06/08

移動ロボットの自律走行のための深層強化学習で与える報酬による走行経路への影響

*古谷琢海, 加藤勇気, 森岡一幸

著者情報

キーワード: Robot Navigation, Deep Q-Network, Reward

会議録・要旨集認証あり

詳細

抄録

We developed an autonomous mobile robot system based on behaviors acquired by deep reinforcement learning. Navigation performances including traveling trajectories are affected by the design of state, action and reward in deep reinforcement learning. This paper focuses on rewards given in training of action policies on the simulator. For example, negative rewards are given to the situations that the robot approaches to obstacles closely. Then, the robot has a tendency to run far from obstacles. In the paper, robot navigation experiments in a real world were performed. Differences of the trajectories according to several rewards are discussed.

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）