2013 Volume 79 Issue 801 Pages 1770-1774
This note presents an application of reinforcement learning to caterpillar robot locomotion. An excellent advantage of reinforcement learning is that an action can be acquired using only a simple reward. In our previous work, the reward was a forward distance measured using a sensor. This reward was completely an “Objective reward”. On the other hand, we have studied using the rewards given by the human's subjective judgment, which is defined as a “Subjective reward”. As a result, the “Subjective reward” gave better results than that of the “Objective reward” under specific conditions. The main purpose of this study is to investigate the teaching methods of human being, which is the main factor that gives better results than “Objective reward”. This note discusses the good teacher who gives an excellent “Subjective reward”. The results show that the human being gives reward in consideration of not only distance but also motion forms, and can give better results than that of the “Objective reward”.
Transactions of the Society of Mechanical Engineers
Transactions of the Japan Society of Mechanical Engineers
Transactions of the Japan Society of Mechanical Engineers Series A
Transactions of the Japan Society of Mechanical Engineers Series B
Transactions of the Japan Society of Mechanical Engineers Series C
TRANSACTIONS OF THE JAPAN SOCIETY OF MECHANICAL ENGINEERS Series A
TRANSACTIONS OF THE JAPAN SOCIETY OF MECHANICAL ENGINEERS Series B