Reinforcement learning to achieve real-time control of triple inverted pendulum

Baek, Jongchan; Lee, Changhyeon; Lee, Young Sam; Jeon, Soo; Han, Soohee

doi:10.1016/j.engappai.2023.107518

상세 보기

Reinforcement learning to achieve real-time control of triple inverted pendulum

Baek, Jongchan;
Lee, Changhyeon;
Lee, Young Sam;
Jeon, Soo;
Han, Soohee

Citations

WEB OF SCIENCE

18

Citations

SCOPUS

30

초록

This work uses reinforcement learning (RL) to achieve the first-ever data-driven real-time control of an actual, not simulated, triple inverted pendulum (TIP) in a model-free way. A swing-up control task for the TIP is formulated as a Markov decision process with a dense reward function, then conducted in real time by using a model-free RL approach. To increase the sample efficiency of learning, a structure-aware virtual experience replay (VER) method is proposed; it works together with an off-policy actor-critic algorithm. The VER exploits the geometrically-symmetric property of TIPs to create virtual sample trajectories from measured ones, then uses the resulting multifold augmented dataset to effectively train actor and critic networks during the learning process. These structure-infused training data serve to obtain additional information and hence increase the convergence speed of network learning. We combine the proposed VER with a state-of-the-art actor-critic algorithm, and then validate its effectiveness through numerical simulations. Notably, the inclusion of VER amplifies computational efficiency, slashing the requisite trials, training steps, and overall duration by approximately 66.67%. Finally experiments demonstrate the real-time control capability of the proposed approach on an actual TIP system.

키워드

Triple pendulum on a cart; Swing-up control; Reinforcement learning; Virtual experience replay; SWING-UP; CART

제목: Reinforcement learning to achieve real-time control of triple inverted pendulum

저자: Baek, Jongchan; Lee, Changhyeon; Lee, Young Sam; Jeon, Soo; Han, Soohee

DOI: 10.1016/j.engappai.2023.107518

발행일: 2024-02

유형: Article

저널명: Engineering Applications of Artificial Intelligence

권: 128