A learning strategy using simulator for real hardware of swing-up pendulum

Shingo Nakamura; Ryo Saegusa; Shuji Hashimoto

doi:10.14864/softscis.2006.0.971.0

SCIS & ISIS 2006

セッションID: FR-I2-2

DOI https://doi.org/10.14864/softscis.2006.0.971.0

会議情報

主催: Japan SOciety for Fuzzy Theory and intelligent informatics

共催: The Korea Fuzzy Logic and Intelligent Systems Society, IEEE Computational Intelligence Society, The International Fuzzy Systems Association, 21th Century COE Program "Creation of Agent-Based Social Systems Sciences"

FR-I2 Adaptive behavior in autonomous robots

A learning strategy using simulator for real hardware of swing-up pendulum

*Shingo Nakamura, Ryo Saegusa, Shuji Hashimoto

著者情報

キーワード: Swing-up pendulum, Simulator, Machine learning

会議録・要旨集フリー

詳細

抄録

We proposed a novel method of hybrid machine learning using both simulator and real hardware. In advance, a simulator of the hardware is built with the actually acquired data from the real hardware using neural networks and the back-propagation learning method. Afterward, the objective controller of the hardware is trained only with the built simulator by the reinforcement learning method. Finally, the controller is applied to the real hardware. The both learning processes for the simulator and the controller are performed without using the real hardware after the data sampling, therefore load against the hardware is less than using the real hardware, and the objective controller can be optimized faster than real time learning. As an example, we picked up the pendulum swing-up task which was a typical nonlinear control problem, and the proposed method worked successfully.

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）