城市交叉口Agent间的多遇交互历史学习协调方法

doi:10.3969/j.issn.1000-565X.2011.03.022

华南理工大学学报（自然科学版） ›› 2011, Vol. 39 ›› Issue (3): 114-119.doi: 10.3969/j.issn.1000-565X.2011.03.022

城市交叉口Agent间的多遇交互历史学习协调方法

夏新海许伦辉

华南理工大学土木与交通学院，广东广州 510640

收稿日期:2010-05-21 修回日期:2010-10-06 出版日期:2011-03-25 发布日期:2011-02-01
通信作者: 夏新海(1978-)，男，博士生，广州航海高等专科学校讲师，主要从事交通运输研究 E-mail:xiaxinhai@126.com
作者简介:夏新海(1978-)，男，博士生，广州航海高等专科学校讲师，主要从事交通运输研究
基金资助:
国家自然科学基金资助项目(60664001)

A Multi-Interaction History Learning Approach for Coordination of Urban Intersection Agents

Xia Xin-hai Xu Lun-hui

South China university of technology of civil and traffic institute, guangdong guangzhou 510640

Received:2010-05-21 Revised:2010-10-06 Online:2011-03-25 Published:2011-02-01
Contact: 夏新海(1978-)，男，博士生，广州航海高等专科学校讲师，主要从事交通运输研究 E-mail:xiaxinhai@126.com
About author:夏新海(1978-)，男，博士生，广州航海高等专科学校讲师，主要从事交通运输研究
Supported by:
国家自然科学基金资助项目(60664001)

摘要/Abstract

摘要： 为信号控制的城市道路交叉口定义一个Agent结构模型,利用双人对策Nash平衡理论构建了城市交叉口Agent间的多遇交互模型,每一交叉口Agent与相邻交叉口Agent进行多次交互学习,根据选择策略获得的效用值来更新它的混合策略.利用记忆因子δ、学习概率α、交叉口交通流变化概率βi等参数分析了交叉口Agent间的循环学习协调过程.设计了交叉口Agent多遇交互历史学习协调算法,在此算法里交叉口Agent可以通过对其他相邻交叉口Agent以往历史交互行为特别是最近的历史行为的记忆学习达到协调.以数个交叉口相连接的干道为例分析了δ、α、βi等参数对算法性能的影响.通过干道上交叉口交通信号协调的实例分析,证明了该协调学习方法的有效性.

关键词: Agent, 交通信号控制, 学习, 协调

Abstract:

Proposed in this paper is a multi-interaction history learning approach for the coordination of urban intersection agents.In the investigation,first,each signalized intersection is defined with an Agent controller.Next,a multi-interaction model for urban intersection Agents is built based on the two-person Nash equilibrium game theory to make each intersection Agent to perform multi-interaction learning with its neighbours and to update its mixed strategy according to the utility value of the selected strategy.Then,the iterative interaction learning process of intersection Agents is analyzed by using the parameters such as memory factor δ,learning probability α and local traffic change probability βi at each intersection.A multi-interactive history learning algorithm was constructed.In the proposed algorithm,intersection Agents coordinate by taking into consi-deration all history interactive information（especially the recent one） coming from neighbouring intersection Agents.Finally the effects of parameters δ,α and βi on the algorithm performance is also analyzed by the experiment of traffic signal control at some connected intersections.The results show that the proposed coordinative learning approach is effective.

Key words: Agent, traffic signal control, learning, coordination

夏新海许伦辉. 城市交叉口Agent间的多遇交互历史学习协调方法[J]. 华南理工大学学报（自然科学版）, 2011, 39(3): 114-119.

Xia Xin-hai Xu Lun-hui. A Multi-Interaction History Learning Approach for Coordination of Urban Intersection Agents[J]. Journal of South China University of Technology (Natural Science Edition), 2011, 39(3): 114-119.

[1]	雷宇, 黄亦凡, 罗学东, 等. 基于NMI-FA-DELM模型的土壤热导率预测[J]. 华南理工大学学报(自然科学版), 2023, 51(9): 129-138.
[2]	李方, 郭炜森, 张平, 等. 基于时空双细胞状态的轴承剩余使用寿命预测方法[J]. 华南理工大学学报(自然科学版), 2023, 51(9): 69-81.
[3]	苏锦钿, 余珊珊, 洪晓斌. 一种面向中文拼写纠错的自监督预训练方法[J]. 华南理工大学学报(自然科学版), 2023, 51(9): 90-98.
[4]	李家春, 李博文, 林伟伟. AdfNet：一种基于多样化特征的自适应深度伪造检测网络[J]. 华南理工大学学报(自然科学版), 2023, 51(9): 82-89.
[5]	王福建, 程慧玲, 马东方, 等. 基于深度逆向强化学习的城市车辆路径链重构[J]. 华南理工大学学报(自然科学版), 2023, 51(7): 120-128.
[6]	郭恩强, 符锌砂. 基于特征相似性学习的抛洒物检测方法[J]. 华南理工大学学报(自然科学版), 2023, 51(6): 30-41.
[7]	赵建东, 焦岚馨, 赵志敏, 等. 考虑侧向车换道影响的理论和数据组合驱动的车辆跟驰模型[J]. 华南理工大学学报(自然科学版), 2023, 51(6): 10-19.
[8]	叶峰, 陈彪, 赖乙宗. 基于特征空间嵌入的对比知识蒸馏算法[J]. 华南理工大学学报(自然科学版), 2023, 51(5): 13-23.
[9]	刘怡俊, 曹宇, 叶武剑, 等. 基于FPGA并行加速的脉冲神经网络在线学习硬件结构的设计与实现[J]. 华南理工大学学报(自然科学版), 2023, 51(5): 104-113.
[10]	周楚昊, 林培群, 闫明月. 基于自监督学习的交通数据补全算法[J]. 华南理工大学学报(自然科学版), 2023, 51(4): 101-114.
[11]	陈锋, 毛豪滨, 蔡吉玲, 等. 面向低延时实时视频的多维跨层带宽预测[J]. 华南理工大学学报(自然科学版), 2023, 51(11): 18-27.
[12]	许伦辉, 余佳芯, 裴明阳, 等. 基于几何路网结构和强化学习的车辆重定位策略[J]. 华南理工大学学报(自然科学版), 2023, 51(10): 99-109.
[13]	王昊, 谢凝. 考虑有轨电车效率的干线分段绿波协调优化模型[J]. 华南理工大学学报(自然科学版), 2023, 51(1): 95-105.
[14]	王高, 陈晓鸿, 柳宁, 等. 一种基于视角选择经验增强算法的机器人抓取策略[J]. 华南理工大学学报(自然科学版), 2022, 50(9): 126-137.
[15]	卢凯, 赵世杰, 吴焕, 等. 饱和交叉口的双向红绿波协调设计数解算法[J]. 华南理工大学学报(自然科学版), 2022, 50(9): 1-11.

城市交叉口Agent间的多遇交互历史学习协调方法

A Multi-Interaction History Learning Approach for Coordination of Urban Intersection Agents

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价