基于多层异构蒸馏图神经网络的自适应交通信号控制方法

doi:10.12141/j.issn.1000-565X.250509

华南理工大学学报(自然科学版)

基于多层异构蒸馏图神经网络的自适应交通信号控制方法

陈昱光^1，2 海凌滔¹ 张顺¹ 高加尧¹ 郭凤香¹

1．昆明理工大学交通工程学院，云南昆明 650031；

2.东南大学交通学院，江苏省南京市 211189

发布日期:2026-01-23

Adaptive Traffic Signal Control Method Based on Multi-layer Heterogeneous Distillation Diagram Neural Network

CHEN Yuguang^1,2 HAI Lingtao¹ ZHANG Shun¹ GAO Jiayao¹ GUO Fengxiang¹

1. Faculty of Transportation Engineering, Kunming University of Science and Technology，Kunming 650500, Yunnan, China；

2. School of Transportation, Southeast University, Nanjing 210096, Jiangsu, China

Published:2026-01-23

摘要/Abstract

摘要：

深度强化学习（DRL）在自适应交通信号控制（ATSC）中得到了广泛应用，但现有算法不能很好地捕获全面的交叉口状态，也缺乏复杂交通流组成对信号控制效果影响的考虑。本文提出了一种基于知识蒸馏异构图神经网络（KAHGN-Q）的深度学习算法，提取目标交叉口每一进口道和相邻交叉口有影响进口道的交通流信息，得到完整全面的交叉口状态表示。搭建了一种新的图神经网络输入架构，将交通流分为三个层次建模，将节点分为方向级节点、车道类型级节点和车辆类型级节点，耦合宏观交通流与微观车辆组成特征。使用引入动态优先级优先体验回放（PERDP）的D3QN强化学习，不断学习最优选择策略的同时保证所有策略的全覆盖。在奖励设置上对不同的车辆类型采用不同的权重，可实现公交优先。实验结果表明，KAHGN-Q算法在减少车辆平均等待时间、平均延误等方面具有优势。

关键词: 城市交通, 交通信号控制, 深度强化学习, 异构图神经网络, 知识蒸馏

Abstract:

Deep reinforcement learning (DRL) has been widely used in adaptive traffic signal control (ATSC), but the existing algorithms can't capture the overall intersection state well, and also lack the consideration of the influence of complex traffic flow composition on the signal control effect. In this paper, a deep learning algorithm based on KAHGN-Q is proposed, which can extract the traffic flow information of each entrance of the target intersection and the adjacent intersections, and obtain a complete and comprehensive intersection state representation. A new graph neural network input architecture is built, which divides traffic flow into three levels, divides nodes into direction level nodes, lane type nodes and vehicle type nodes, and couples macro traffic flow and micro vehicle composition characteristics. D3QN reinforcement learning with prioritized experience replay incorporating dynamic priorities (PERDP) is employed to continuously learn an optimal action-selection policy while ensuring full coverage of all action strategies. Different weights are used for different vehicle types in the reward setting, which can be used to realize bus priority. The experimental results show that KAHGN-Q algorithm has advantages in reducing the average waiting time and average delay of vehicles.

Key words: urban transportation, traffic signal control, deep reinforcement learning, heterogeneous graph neural network, knowledge distillation

陈昱光, 海凌滔, 张顺, 等. 基于多层异构蒸馏图神经网络的自适应交通信号控制方法[J]. 华南理工大学学报(自然科学版), doi: 10.12141/j.issn.1000-565X.250509.

CHEN Yuguang, HAI Lingtao.ZHANG Shun, et al. Adaptive Traffic Signal Control Method Based on Multi-layer Heterogeneous Distillation Diagram Neural Network[J]. Journal of South China University of Technology(Natural Science Edition), doi: 10.12141/j.issn.1000-565X.250509.

[1]	程小华, 王泽夫, 曾君, 等. 基于EA-RL算法的分布式能源集群调度方法[J]. 华南理工大学学报(自然科学版), 2025, 53(1): 1-9.
[2]	陈锋, 毛豪滨, 蔡吉玲, 等. 面向低延时实时视频的多维跨层带宽预测[J]. 华南理工大学学报(自然科学版), 2023, 51(11): 18-27.
[3]	陈小红, 胡芳. 基于可能度的交叉口信号控制区间优化模型[J]. 华南理工大学学报(自然科学版), 2022, 50(10): 29-40.
[4]	杨兆升曲鑫林赐云邴其春王世广. 考虑低排放低延误的交通信号优化方法[J]. 华南理工大学学报（自然科学版）, 2015, 43(10): 29-34,41.
[5]	夏新海许伦辉. 城市交叉口Agent间的多遇交互历史学习协调方法[J]. 华南理工大学学报（自然科学版）, 2011, 39(3): 114-119.

基于多层异构蒸馏图神经网络的自适应交通信号控制方法

Adaptive Traffic Signal Control Method Based on Multi-layer Heterogeneous Distillation Diagram Neural Network

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 5

编辑推荐

Metrics

本文评价