連続空間での深層強化学習による羊の群れのハーディングに関する基礎検討

飯村 伊智郎; 鈴木 俊亮; 中山 茂

doi:10.1541/ieejeiss.142.149

Abstract

Strömbom et al. elucidated an algorithm in which a sheepdog can skillfully control a flock of sheep to guide them to a destination. This is called the Herding Algorithm, and it models the behavior of a sheepdog in two ways: “driving”, which guides a flock of sheep to a destination, and “collecting”, which brings the sheep together into one flock. In this model, Go et al. showed that an agent (sheepdog) could herd a flock of sheep with an inference model generated by reinforcement learning (RL). However, in their previous study, RL learned only the movement behavior to the positions at which the agent performs “driving” and “collecting” in the discretized environmental state and behavioral space. In this study, we have assumed a continuous environmental state and behavioral space. We have confirmed that even if the agent's herding behavior is the learning target, the proposed inference model generated by deep RL can herd sheep.

Content from these authors

Favorites & Alerts

Corresponding author

Register with J-STAGE for free!