強化学習による協働ロボットの逆運動学の解の種類の選択と特異点や障害物の回避

加藤 大暉; 前田 直毅; 武内 歩; 廣垣 俊樹; 青山 栄一

doi:10.2493/jjspe.89.783

Abstract

This paper proposes a method of avoiding singularities and obstacles by selecting the types of inverse kinematics solution of a manipulator by reinforcement learning. Deep Q-Network (DQN) selects from eight different types of solution that enable the manipulator to avoid singularities and obstacles throughout its motion path. This proposed method is applied to a 6-DOF collaborative robot. DQN, a type of reinforcement learning, is constructed with six joint angles as observation and eight types of solution as action. The motion path of the manipulator is divided into steps every 0.1s, and the type of solution at each step is selected by DQN. The agent is rewarded when the manipulator reaches the end of its motion path, and punished when it collides with the obstacle or itself, and according to the six joint angular velocities. As a result, DQN selects the types of solution that can avoid singularities and obstacles. The proposed method makes it possible to select which of the types of solution can realize the motion path of the robot hand without colliding with obstacles and which minimize the joint angular velocities.

Content from these authors

Favorites & Alerts

Corresponding author

Register with J-STAGE for free!