Journal of South China University of Technology(Natural Science Edition) ›› 2025, Vol. 53 ›› Issue (5): 82-93.doi: 10.12141/j.issn.1000-565X.240480

• Computer Science & Technology • Previous Articles     Next Articles

A Spatiotemporal Heterogeneous Two-Stage Fusion Network for Traffic Flow Prediction

HOU Yue(), YIN Jie, ZHANG Zhihao, LU Keke   

  1. School of Electronics and Information Engineering,Lanzhou Jiaotong University,Lanzhou 730070,Gansu,China
  • Received:2024-09-26 Online:2025-05-25 Published:2024-12-04
  • Supported by:
    the National Natural Science Foundation of China(62063014);the Natural Science Foundation of Gansu Province(22JR5RA365)

Abstract:

In response to the existing traffic flow prediction studies that fail to fully integrate complex spatiotemporal correlations and heterogeneities, this paper designs a traffic flow prediction network based on grid data, namely the spatiotemporal heterogeneous two-stage fusion neural network marked as ST_HTFNN. This network employs a phased and hierarchical spatiotemporal feature extraction architecture, and adopts a new model where the static and dynamic feature extraction stages are serialized. In the static feature extraction stage, a novel Mamba-like linear attention (MLLA) block is introduced as a static heterogeneous fusion unit to achieve spatial correlation and heterogeneity fusion mining. In the dynamic feature extraction stage, a simple and efficient dynamic heterogeneous fusion unit is designed, and dilated convolution is combined with gating mechanisms to adaptively fuse and capture global and local spatiotemporal correlations and heterogeneities. Furthermore, to address the smoothing of road features during the deep convolution process for road-level traffic flow characteristics, a road feature enhancement module is designed to reconstruct and enhance road information. Finally, an external disturbance feature fusion module is designed to integrate the impact of external disturbance features on traffic flow prediction results. Experimental results on three real-world traffic datasets, namely BikeNYC, TaxiCQ and TaxiBJ, demonstrate that the ST_HTFNN model outperforms the existing benchmark methods, respectively with a decrease of 6.13%, 0.8% and 7.01% in the mean absolute error of prediction accuracy.

Key words: traffic flow prediction, grid data, spatiotemporal heterogeneity, dilated convolution, gating mechanism

CLC Number: