Journal of South China University of Technology(Natural Science Edition) ›› 2023, Vol. 51 ›› Issue (10): 99-109.doi: 10.12141/j.issn.1000-565X.230148

Special Issue: 2023绿色智慧交通系统专辑

• Green, Intelligent Traffic System • Previous Articles     Next Articles

Repositioning Strategy for Ride-Hailing Vehicles Based on Geometric Road Network Structure and Reinforcement Learning

XU Lunhui1 YU Jiaxin1 PEI Mingyang1 WU Pan2 LI Peng3   

  1. 1.School of Civil Engineering and Transportation,South China University of Technology,Guangzhou 510640,Guangdong,China
    2.School of Transportation Engineering,Chongqing Jiaotong University,Chongqing 400000,China
    3.School of Automobile and Transportation,Shenzhen Polytechnic,Shenzhen 518055,Guangdong,China
  • Received:2023-03-28 Online:2023-10-25 Published:2023-06-05
  • Contact: 裴明阳(1993-),女,博士,副教授,主要从事交通系统优化建模研究。 E-mail:mingyang@scut.edu.cn
  • About author:许伦辉(1965-),男,博士,教授,主要从事智能交通系统研究。E-mail:lhxu@scut.edu.cn
  • Supported by:
    the Basic and Applied Research Foundation of Guangdong Province(2020A1515111024)

Abstract:

The inefficient and inaccurate bidirectional search by both ride-hailing drivers and passengers leads to a mismatch between supply and demand. Ride-hailing vehicle repositioning strategy can pre-dispatch vehicles to areas with future demand, improving supply-demand matching. However, existing research mostly uses network grids to represent the urban road environment, lacking geometric topological information and reducing the dispatch accuracy. To address this issue, a ride-hailing vehicle relocation algorithm called GA2C was proposed based on Graph Neural Networks (GNN) and Actor-Critic reinforcement learning algorithm. This algorithm has a smoother learning process and can perform high-dimensional sampling, and it is suitable for learning the best relocation strategy for a large number of ride-hailing vehicles as multi-agent systems. Moreover, the geometric network structure was used to represent the urban road environment by using a GNN as a function approximator to learn the geometric information of the road network. Additionally, an action sampling strategy based on action value function was introduced to increase the randomness of action selection, effectively preventing competition. A ride-hailing vehicle relocation simulation experiment was conducted using Python, and the results are as follows: (i) the order response rate of the GA2C algorithm is 84.2%, significantly higher than all the comparative experimental results; (ii) in the order distribution comparative experiment, GA2C’s relative improvements in uniform distribution, central distribution layout, block distribution layout, and checkerboard distribution layout are 1.17%, 6.02%, 13.12%, and 14.55%, respectively. The above experimental results demonstrate that the GA2C algorithm can effectively relocate ride-hailing vehicles. When the order distribution presents significant differences, and the distance between different demand areas is relatively close, it can better learn dynamic demand changes, and achieve maximum order response rate by relocating ride-hailing vehicles.

Key words: urban traffic, repositioning strategy of ride-hailing, multi-agent reinforcement learning, graph neural network, Markov decision process

CLC Number: