In general, the effects of modeling errors, parameter variations, and external disturbances of the controlled object degrade tracking control performance. To address these problems, we have developed a control method based on deep reinforcement learning. Accordingly, a reference signal self-organizing control system based on a deep deterministic policy gradient (DDPG) is proposed, which is an extension of an existing control system using DDPG. In a previous study, we confirmed the realization of the swing-up and stabilizing motions of an inverted pendulum using the proposed control system(8). However, the addition of a new ability to the system could not be verified in that study. Thus, in this work, we aim to verify whether a new function can be added to the proposed control system. By performing a control simulation, we verified whether the proposed system can achieve robustness by using the inverted pendulum with an inertia rotor. A control simulation of the system was performed by adding noise into the system, and its control performance was investigated to confirm the robustness of the system. The simulation results indicate that the pendulum could not be inverted using an un-retrained control system. However, it was confirmed to have been inverted and stabilized by the swing-up and stabilizing control using a retrained control system. Moreover, the retrained control system could effectively function under the effect of noise with an accuracy close to that of a noise-free state. Therefore, we confirmed that the addition of robustness can be realized in the proposed control system.
抄録全体を表示