Journal of South China University of Technology(Natural Science Edition) ›› 2023, Vol. 51 ›› Issue (11): 44-55.doi: 10.12141/j.issn.1000-565X.220747

Special Issue: 2023年电子、通信与自动控制

• Electronics, Communication & Automation Technology • Previous Articles     Next Articles

Driverless Obstacle Avoidance and Tracking Control Based on Improved DDPG

LI Xinkai HU Xiaocheng MA Ping ZHANG Hongli   

  1. School of Electrical Engineering,Xinjiang University,Urumqi 830017,Xinjiang,China
  • Received:2022-11-14 Online:2023-11-25 Published:2023-03-28
  • About author:李新凯(1991-),男,博士,讲师,主要从事智能控制、复杂非线性控制研究。E-mail:lxk@xju. edu. cn
  • Supported by:
    the National Natural Science Foundation of China(62263030)

Abstract:

In the process of tracking and obstacle avoidance control of driverless vehicles, the controlled object has nonlinear characteristics and variable control parameters. The linear model and the fixed mathematical model of driverless vehicles are difficult to ensure the safety and stability of the vehicle in complex environments, and the driverless discrete control process increases the difficulty of control. To address such problems, in order to improve the accuracy of real-time control tracking trajectory of driverless vehicles, and at the same time reduce the difficulty of the whole control process, the paper proposed a Monte Carlo-depth deterministic policy gradient-based obstacle avoidance tracking control algorithm for driverless vehicles. The algorithm builds a control system model based on a deep reinforcement learning network, and adopts excellent training samples in the strategy learning sampling process. It optimizes the network training gradient with the Monte Carlo method, and makes a distinction between good and bad training samples for the algorithm. The excellent samples are used to find the optimal network parameters through a gradient algorithm, so as to enhance the learning ability of the network algorithm and realize a better and continuous control of the driverless vehicle. Simulation experiments of the control method were carried out in the computer simulation environment TORCS. The results show that the proposed improved DDPG algorithm can be applied to effectively achieve the obstacle avoidance tracking control of the driverless vehicle, and the tracking accuracy and obstacle avoidance effect of the unmanned car under its control is better than that of the deep Q network algorithm and the DDPG algorithm.

Key words: self-driving, dynamic obstacle avoidance, depth deterministic policy gradient, trajectory tracking, gradient optimization

CLC Number: