Robot Navigation in Real World by Two Dimensional Evaluation Reinforcement Learning

Hiroyuki Okada; Hiroshi Yamakawa; Takashi Omori

doi:10.7210/jrsj.19.244

Hiroyuki Okada, Hiroshi Yamakawa, Takashi Omori

Author information

Keywords: Reinforcement Learning, Reward, Punishment, Mobile Robots, MEMORABLE

JOURNAL FREE ACCESS

2001 Volume 19 Issue 2 Pages 244-251

DOI https://doi.org/10.7210/jrsj.19.244

Details

Abstract

The trade-off of exploration and exploitation is present for a learnig method based on the trial and error such as reinforcement learning. We have proposed a reinforcement learning algorism using reward and punishment as repulsive evaluation (2D-RL) . In the algorithm, an appropriate balance between exploration and exploitation can be attained by using interest and utility. In this paper, we applied the 2D-RL to a navigation learning task of mobile robot, and the robot found a better path in real world by 2D-RL than by traditional actor-critic model.

Corresponding author

Register with J-STAGE for free!