1. Posture self-stabilizer of a biped robot based on training platform and reinforcement learning
- Author
-
Liyang Gao and Weiguo Wu
- Subjects
0209 industrial biotechnology ,Computer science ,General Mathematics ,Stability (learning theory) ,Evolutionary robotics ,02 engineering and technology ,Computer Science Applications ,020901 industrial engineering & automation ,Electronic stability control ,Control and Systems Engineering ,Control theory ,0202 electrical engineering, electronic engineering, information engineering ,State space ,Robot ,Reinforcement learning ,020201 artificial intelligence & image processing ,Software ,Simulation ,Abstraction (linguistics) - Abstract
In order to solve the problem of stability control for biped robots, the concept of stability training is proposed by using a training platform to exert random disturbance with amplitude limitation on robots that are to be trained. In this work, an approach to achieve a posture stabilizing capability based on stability training and reinforcement learning is explored and verified by simulations. An automatic abstraction method for state space is proposed by using the Gauss basis function and inner evaluation indexes to speed up the learning process. Hierarchical structure stabilizer using the Monte Carlo method is designed according to the concept of variable ZMP. Training samples are extracted from the state transition of the stability training process using balance controllers based on the robot dynamic model. The stabilizers are trained with and without applying the automatic abstraction of state space. Then simulation tests of them are conducted under conditions where the training platform exerts amplitude-limited random disturbances on the robot. Also, the influence of the model errors is studied by introducing deviations of the CoM position during the simulation tests. By comparing the simulation results of two learning stabilizers and the model-based balance controller, it is demonstrated that the designed stabilizer can achieve approximate success rate of the ideal model-based balance controller and exert all the driving ability of the robot under the large disturbance condition of 30 inclination of the platform. Also, the effects of the model error can be overcome by retraining using state transition data with the model error. The active training concept is proposed by applying a training platform.The training platform disturbs robots on it with amplitude-limited random motions.An automatic abstraction method is proposed for the high-dimensional state space.A learning posture stabilizer with hierarchical structure is designed.
- Published
- 2017
- Full Text
- View/download PDF