华南理工大学学报(自然科学版)

• 计算机科学与技术 • 上一篇    下一篇

结合时空图卷积和大语言模型的交通流预测方法

王书海1,2  李宁1   潘晓1   任洺萱3   

  1. 1.石家庄铁道大学 信息科学与技术学院,河北 石家庄 050043;

    2.石家庄市人工智能重点实验室,河北 石家庄 050043;

    3.石家庄铁道大学 交通运输学院,河北 石家庄 050043

  • 发布日期:2026-02-27

Traffic Flow Prediction Method Combining Spatio-Temporal Graph Convolution and Large Language Models

WANG Shuhai1,2  LI Ning1  PAN Xiao1  REN Mingxuan3   

  1. 1.School of Information Science and Technology, Shijiazhuang Tiedao University, Shijiazhuang 050043, China;

    2.Shijiazhuang Key Laboratory of Artificial Intelligence, Shijiazhuang 050043, China;

    3.School of Traffic and Transportation, Shijiazhuang Tiedao University, Shijiazhuang 050043, China

  • Published:2026-02-27

摘要:

交通流预测是智能交通系统(Intelligent Transportation System,ITS)的重要组成部分。当前交通流量预测仍存在以下局限:对交通流量预测过程中空间相关性的关注不足,主要面向个别尺度模式下的交通流量预测,且较少考虑天气等外部因素对交通流量预测的干扰。针对上述问题,本研究提出了一种结合多尺度时空图卷积和大语言模型的交通流量预测方法。该方法首先对交通数据进行趋势序列的划分,设计了包括空间和时间维度上的注意力模块和卷积模块的时空嵌入层,用以提取交通数据的空间和时间特征。将提取到的时空特征输入到带有动态冻结部分注意力机制的大语言模型(Large Language Model,LLM)中,并应用参数高效微调(Parameter-Efficient Fine-Tuning, PEFT) 技术,在降低计算资源消耗的同时,使模型可以更好地适应具有动态时空相关性的交通流量预测任务。此外,考虑到交通数据的非平稳性,在外部使用多层感知机模块增强对交通流量的预测。最后,在两个真实的数据集PEMS04和PEMS08上进行模型的有效性验证,结果表明,提出模型的预测误差精度指标平均MAE与次优模型OFA对比,提高了1.05%和1.87%。

关键词: 智慧交通系统, 大语言模型, 时空图卷积, 参数高效微调, 交通流量预测

Abstract:

Traffic flow prediction is an important component of the Intelligent Transportation System (ITS). Currently, traffic flow prediction still has the following limitations: insufficient attention to the spatial correlation in the traffic flow prediction process, mainly focusing on traffic flow prediction at individual scale patterns, and rarely considering the interference of external factors such as weather on traffic flow prediction. To address these issues, this study proposes a traffic flow prediction method that combines multi-scale spatiotemporal graph convolution and large language models. This method first divides the traffic data into trend sequences, designs a spatiotemporal embedding layer including attention modules and convolutional modules in the spatial and temporal dimensions to extract the spatial and temporal features of the traffic data. The extracted spatiotemporal features are input into a large language model (LLM) with a dynamically frozen partial attention mechanism, and the parameter-efficient fine-tuning (PEFT) technique is applied to make the model better adapt to traffic flow prediction tasks with dynamic spatiotemporal correlations. Additionally, considering the non-stationarity of traffic data, a multi-layer perceptron module is used externally to enhance the prediction of traffic flow. Finally, the effectiveness of the model is verified on two real datasets, PEMS04 and PEMS08. The results show that the average MAE of the prediction error accuracy index of the proposed model is 1.05% and 1.87% higher than that of the suboptimal model OFA.

Key words: intelligent transportation systems, large language models, spatio-temporal graph convolution, parameter-efficient fine-tuning, traffic flow prediction