Green & Intelligent Transportation

Taxi Trajectory Characteristics Analysis Based on Frequent Sequence Mining

  • LONG Xueqin ,
  • WANG Han ,
  • WANG Ruixuan
Expand
  • College of Transportation Engineering,Chang’an University,Xi’an 710064,Shaanxi,China

Received date: 2023-06-01

  Online published: 2023-11-08

Supported by

the Natural Science Basic Research Program of Shaanxi Province(2024JC-YBMS-338);the Provincial Key Research and Development Program of Shaanxi(2023-YBGY-138)

Abstract

In order to further clarify the differences in routing behaviors of different taxis, this paper adopted the method of frequent sequence mining to extract the frequent path between the same OD pairs, construct path sets, and analyze the similar characteristics of path sets from static and dynamic perspectives. By taking the trajectory data of taxis in Xi’an City as the research object, the path set between OD pairs is obtained through grid division and road network matching. Then, the frequent path is redefined, the PrefixSpan evolution algorithm is adopted, and the dynamic threshold and frequency index based on the obtained frequent subsequences are introduced to mine frequent paths. Furthermore, in order to complete the construction of three kinds of effective path sets, the shortest path and other paths are extracted, and the general properties of the constructed path sets are analyzed. Finally, the similarity between two-dimension time series (tracks) on the path is represented as dynamic similarity, and the similarity between one-dimension directed sequences (sections) is represented as static similarity, and the similarity analysis of three types of paths is carried out based on the improved longest common subsequence and dynamic time regularity algorithm. The results show that: (1) the similarity between the frequent path and the shortest path is rather high, meaning that most taxis still choose the road with the lowest travel time but not the shortest path; (2) time and distance are still the main considerations for travelers when choosing a path, but travelers do not completely pursue the shortest time or distance; (3) the calculated dynamic similarity is significantly higher than the static similarity, which means that the two-dimension sequential similarity on the path is higher than the one-dimension shape similarity; and (4) the two proposed methods both possess the highest similarity between the frequent path and the shortest path and the lowest similarity between the shortest path and other paths The consistency of the comparison results indicates that the similarity of the static path can be roughly measured by the that of the dynamic trajectory. The proposed frequent path mining algorithm is of certain reliability. It can provide supports for urban traffic managers with recommend routes and planed roads.

Cite this article

LONG Xueqin , WANG Han , WANG Ruixuan . Taxi Trajectory Characteristics Analysis Based on Frequent Sequence Mining[J]. Journal of South China University of Technology(Natural Science), 2024 , 52(6) : 24 -33 . DOI: 10.12141/j.issn.1000-565X.230375

References

1 CHEN G, CHEN B, YU Y .Mining frequent trajectory patterns from GPS tracks[C]∥Proceedings of 2010 International Conference on the Computational Intelligence and Software Engineering.Wuhan:IEEE,2010:1-6.
2 LYU J, SUN Q, LI Q,et al .Multi-scale and multi-scope convolutional neural networks for destination prediction of trajectories[J].IEEE Transactions on Intelligent Transportation Systems202021(8):3184-3195.
3 BIN C, GU T, SUN Y,et al .A personalized POI route recommendation system based on heterogeneous tourism data and sequential pattern mining[J].Multimedia Tools and Applications201978:35135-35156.
4 QIAN S, CHENG B, CAO J,et al .Detecting taxi trajectory anomaly based on spatio-temporal relations[J].IEEE Transactions on Intelligent Transportation Systems202123(7):11-12.
5 ZHENG L J, XIA D, ZHAO X,et al .Spatial-temporal travel pattern mining using massive taxi trajectory data[J].Physica A:Statistical Mechanics and Iits Applications2018501:24-41.
6 杨振娟 .基于出租车GPS轨迹的热点路後挖掘和载客路径推荐[D].兰州:西北师范大学,2020
7 吴俊伟,朱云龙,库涛,等 .基于网格聚类的热点路径探测[J].吉林大学学报(工学版)201545(1):274-282.
  WU Jun-wei, ZHU Yun-long, KU Tao,et al .Hot routes detection algorithm based on grid clustering[J].Journal of Jilin University (Engineering and Technology Edition)201545(1):274-282.
8 段宗涛,任国亮,康军,等 .基于频繁轨迹序列模式挖掘的路径推荐方法[J].太原理工大学学报202253(2):240-247.
  DUAN Zong-tao, REN Guo-liang, KANG Jun,et al .Route recommendation method based on frequent trajectory sequence pattern mining[J].Journal of Taiyuan University of Technology202253(2):240-247
9 袁淑君 .基于频繁序列挖掘与双层Logit模型的出租车寻客行为建模方法[D].武汉:武汉大学,2019
10 孙文平,常亮,宾辰忠,等 .基于知识图谱和频繁序列挖掘的旅游路线推荐[J].计算机科学201946(2):56-61.
  SUN Wen-ping, CHANG Liang, Chen-zhong BIN,et al .Travel route recommendation based on knowledge graph and frequent sequence mining[J].Computer Science201946(2):56-61.
11 胡冰冰,芦俊丽,郑承宇,等 .改进的 PrefixSpan 算法在旅游热门路线上的应用[J].云南民族大学学报(自然科学版)202231(1):94-102.
  HU Bing-bing, LU Jun-li, ZHENG Cheng-yu,et al .Application of improved PrefixSpan agorithm on popular travel routes[J].Journal of Yunnan Nationalities University (Natural Sciences Edition)202231(1):94-102.
12 林鹏飞,翁剑成,胡松,等 .公共交通乘客个体活动链的日相似性研究[J].交通运输系统工程与信息202020(6):178-183.
  LIN Peng-fei, WENG Jian-cheng, HU Song,et al .Day-to-day similarity of individual activity chain of public transport passengers[J].Journal of Transportation Systems Engineering and Information Technology202020(6):178-183.
13 吴刚,邱煜晶,王国仁 .基于隐马尔可夫模型和遗传算法的地图匹配算法[J].东北大学学报(自然科学版)201738(4):472-475.
  WU Gang, QIU Yu-jing, WANG Guo-ren .Map matching algorithm based on hidden Markov model and genetic algorithm[J].Journal of Northeastern University (Natural Science)201738(4):472-475.
14 SRIKANT R, AGRAWAL R .Mining sequential patterns:generalization sandper form anceim provements[C]∥Proceedings of the 5th International Conferenceon Extending Database Technology:Advances in Database Technology.Avignon:[s. n.],1996:3-17.
15 VLACHOS M, KOLLIOS G, GUNOPULOS D .Discovering similar multidimensional trajectories[C]∥Proceedings of the 18th International Conference on Data Engineering.Washington DC:IEEE Computer Society,2002:673-684.
16 SANKOFF D, KRUSKAL J .Time warps,string edits,and macromolecules:the theory and practice of sequence comparison[M].Boston:Addison-Wesley,1983
Outlines

/