1. Research on the manipulator intelligent trajectory planning method based on the improved TD3 algorithm
- Author
-
Qiang ZHANG, Wen WEN, Xiaodong ZHOU, Weihui LIU, and Xiaoyu CHU
- Subjects
obstacle avoidance planning ,target-guide ,trajectory demonstration ,double training ,Electronic computers. Computer science ,QA75.5-76.95 - Abstract
An intelligent trajectory planning and obstacle avoidance method based on the improved twin delayed deep deterministic policy gradient algorithm (TD3) was proposed to solve the trajectory planning problem for a 4-DOF manipulator mounted on a satellite.The training strategy had 2 periods.In the pre-training stage, the target position was always guided combining with the output of the strategy network to optimize the trajectory.After the pre-training, the algorithm can autonomously output the velocity trajectory while the initial position and the target were specified randomly in the joint space of the manipulator.This target-guided mechanism decreased the unnecessary explorations and improved the learning efficiency in high dimensional action space.In the second training stage, a collision-free safety reference trajectory was firstly obtained by demonstration, and then this trajectory was constantly learned during the training process until the final output trajectory has the ability to avoid obstacles.
- Published
- 2022
- Full Text
- View/download PDF