收稿日期: 2023-06-01
网络出版日期: 2023-11-08
基金资助
陕西省自然科学基础研究计划项目(2024JC-YBMS-338);陕西省重点研发计划项目(2023-YBGY-138)
Taxi Trajectory Characteristics Analysis Based on Frequent Sequence Mining
Received date: 2023-06-01
Online published: 2023-11-08
Supported by
the Natural Science Basic Research Program of Shaanxi Province(2024JC-YBMS-338);the Provincial Key Research and Development Program of Shaanxi(2023-YBGY-138)
为进一步厘清不同出租车路径选择行为的差异性,采用频繁序列挖掘方法提取了同一个OD对间的频繁路径,构建路径选择集,分别从静态和动态两个角度分析路径集的相似特性。以西安市出租车的轨迹数据为研究对象,通过栅格划分与路网匹配,获得了不同OD对之间的路径集合。重新定义了频繁路径,采用PrefixSpan演变算法,在得到频繁子序列的基础上引入动态阈值和频繁度指标挖掘频繁路径,提取了最短路径和其他路径,完成了3类有效路径集的构建,并分析了路径集的一般属性。其后,将路径上二维时间序列(轨迹)间的相似度表示为动态相似度,将一维有向序列(路段)间的相似度表示为静态相似度,基于改进的最长公共子序列和动态时间规整算法对3类路径进行了相似性分析。结果表明:频繁路径与最短路径的相似度较高,意味着大多数出租车仍然选择具有最低出行时间的路段,但不一定会选择最短路径;时间和距离仍是出行者选择路径时主要考虑的因素,但出行者并不完全追求时间最短或距离最短;试验得到的动态相似度计算结果显著高于静态相似度计算结果,说明路径上的二维时序相似度高于一维形状相似度;两种方法下频繁路径和最短路径的相似度均最高,最短路径和其他路径的相似度均最低,比较结果的一致性说明可以用动态轨迹的相似度来大致度量静态路径的相似度。文中的频繁路径挖掘算法具有一定的可靠性,可为城市交通管理者进行路径推荐、道路规划等提供支持。
龙雪琴 , 王晗 , 王瑞璇 . 基于频繁序列挖掘的出租车轨迹特性分析[J]. 华南理工大学学报(自然科学版), 2024 , 52(6) : 24 -33 . DOI: 10.12141/j.issn.1000-565X.230375
In order to further clarify the differences in routing behaviors of different taxis, this paper adopted the method of frequent sequence mining to extract the frequent path between the same OD pairs, construct path sets, and analyze the similar characteristics of path sets from static and dynamic perspectives. By taking the trajectory data of taxis in Xi’an City as the research object, the path set between OD pairs is obtained through grid division and road network matching. Then, the frequent path is redefined, the PrefixSpan evolution algorithm is adopted, and the dynamic threshold and frequency index based on the obtained frequent subsequences are introduced to mine frequent paths. Furthermore, in order to complete the construction of three kinds of effective path sets, the shortest path and other paths are extracted, and the general properties of the constructed path sets are analyzed. Finally, the similarity between two-dimension time series (tracks) on the path is represented as dynamic similarity, and the similarity between one-dimension directed sequences (sections) is represented as static similarity, and the similarity analysis of three types of paths is carried out based on the improved longest common subsequence and dynamic time regularity algorithm. The results show that: (1) the similarity between the frequent path and the shortest path is rather high, meaning that most taxis still choose the road with the lowest travel time but not the shortest path; (2) time and distance are still the main considerations for travelers when choosing a path, but travelers do not completely pursue the shortest time or distance; (3) the calculated dynamic similarity is significantly higher than the static similarity, which means that the two-dimension sequential similarity on the path is higher than the one-dimension shape similarity; and (4) the two proposed methods both possess the highest similarity between the frequent path and the shortest path and the lowest similarity between the shortest path and other paths The consistency of the comparison results indicates that the similarity of the static path can be roughly measured by the that of the dynamic trajectory. The proposed frequent path mining algorithm is of certain reliability. It can provide supports for urban traffic managers with recommend routes and planed roads.
| 1 | CHEN G, CHEN B, YU Y .Mining frequent trajectory patterns from GPS tracks[C]∥Proceedings of 2010 International Conference on the Computational Intelligence and Software Engineering.Wuhan:IEEE,2010:1-6. |
| 2 | LYU J, SUN Q, LI Q,et al .Multi-scale and multi-scope convolutional neural networks for destination prediction of trajectories[J].IEEE Transactions on Intelligent Transportation Systems,2020,21(8):3184-3195. |
| 3 | BIN C, GU T, SUN Y,et al .A personalized POI route recommendation system based on heterogeneous tourism data and sequential pattern mining[J].Multimedia Tools and Applications,2019,78:35135-35156. |
| 4 | QIAN S, CHENG B, CAO J,et al .Detecting taxi trajectory anomaly based on spatio-temporal relations[J].IEEE Transactions on Intelligent Transportation Systems,2021,23(7):11-12. |
| 5 | ZHENG L J, XIA D, ZHAO X,et al .Spatial-temporal travel pattern mining using massive taxi trajectory data[J].Physica A:Statistical Mechanics and Iits Applications,2018,501:24-41. |
| 6 | 杨振娟 .基于出租车GPS轨迹的热点路後挖掘和载客路径推荐[D].兰州:西北师范大学,2020. |
| 7 | 吴俊伟,朱云龙,库涛,等 .基于网格聚类的热点路径探测[J].吉林大学学报(工学版),2015,45(1):274-282. |
| WU Jun-wei, ZHU Yun-long, KU Tao,et al .Hot routes detection algorithm based on grid clustering[J].Journal of Jilin University (Engineering and Technology Edition),2015,45(1):274-282. | |
| 8 | 段宗涛,任国亮,康军,等 .基于频繁轨迹序列模式挖掘的路径推荐方法[J].太原理工大学学报,2022,53(2):240-247. |
| DUAN Zong-tao, REN Guo-liang, KANG Jun,et al .Route recommendation method based on frequent trajectory sequence pattern mining[J].Journal of Taiyuan University of Technology,2022,53(2):240-247 | |
| 9 | 袁淑君 .基于频繁序列挖掘与双层Logit模型的出租车寻客行为建模方法[D].武汉:武汉大学,2019. |
| 10 | 孙文平,常亮,宾辰忠,等 .基于知识图谱和频繁序列挖掘的旅游路线推荐[J].计算机科学,2019,46(2):56-61. |
| SUN Wen-ping, CHANG Liang, Chen-zhong BIN,et al .Travel route recommendation based on knowledge graph and frequent sequence mining[J].Computer Science,2019,46(2):56-61. | |
| 11 | 胡冰冰,芦俊丽,郑承宇,等 .改进的 PrefixSpan 算法在旅游热门路线上的应用[J].云南民族大学学报(自然科学版),2022,31(1):94-102. |
| HU Bing-bing, LU Jun-li, ZHENG Cheng-yu,et al .Application of improved PrefixSpan agorithm on popular travel routes[J].Journal of Yunnan Nationalities University (Natural Sciences Edition),2022,31(1):94-102. | |
| 12 | 林鹏飞,翁剑成,胡松,等 .公共交通乘客个体活动链的日相似性研究[J].交通运输系统工程与信息,2020,20(6):178-183. |
| LIN Peng-fei, WENG Jian-cheng, HU Song,et al .Day-to-day similarity of individual activity chain of public transport passengers[J].Journal of Transportation Systems Engineering and Information Technology,2020,20(6):178-183. | |
| 13 | 吴刚,邱煜晶,王国仁 .基于隐马尔可夫模型和遗传算法的地图匹配算法[J].东北大学学报(自然科学版),2017,38(4):472-475. |
| WU Gang, QIU Yu-jing, WANG Guo-ren .Map matching algorithm based on hidden Markov model and genetic algorithm[J].Journal of Northeastern University (Natural Science),2017,38(4):472-475. | |
| 14 | SRIKANT R, AGRAWAL R .Mining sequential patterns:generalization sandper form anceim provements[C]∥Proceedings of the 5th International Conferenceon Extending Database Technology:Advances in Database Technology.Avignon:[s. n.],1996:3-17. |
| 15 | VLACHOS M, KOLLIOS G, GUNOPULOS D .Discovering similar multidimensional trajectories[C]∥Proceedings of the 18th International Conference on Data Engineering.Washington DC:IEEE Computer Society,2002:673-684. |
| 16 | SANKOFF D, KRUSKAL J .Time warps,string edits,and macromolecules:the theory and practice of sequence comparison[M].Boston:Addison-Wesley,1983. |
/
| 〈 |
|
〉 |