関節トルク信号への摂動に頑健なOffline-to-Online強化学習

綾部 信吾; 計良 宥志; 川本 一彦

doi:10.11517/pjsai.JSAI2025.0_3Win513

Abstract

Offline reinforcement learning (RL) enables policy learning from pre-collected datasets without environmental interaction. This approach reduces the cost of data collection and mitigating safety risks in robotic control. However, real-world deployment requires robustness to control failures, which remain challenging due to the lack of exploration during training. To address this issue, we propose an offline-to-online RL method that improves robustness with minimal online fine-tuning. During fine-tuning, perturbations simulating control component failures are applied to joint torque signals, including random and adversarial perturbations. We conduct experiments using legged robot models in OpenAI Gym. The results demonstrate that offline RL does not improve robustness and remains highly vulnerable to perturbations. In contrast, our method significantly improves robustness against these perturbations.

Content from these authors

Favorites & Alerts

Corresponding author

Conference information

Register with J-STAGE for free!