华南理工大学学报(自然科学版) ›› 2011, Vol. 39 ›› Issue (3): 114-119.doi: 10.3969/j.issn.1000-565X.2011.03.022

• 交通与运输工程 • 上一篇    下一篇

城市交叉口Agent间的多遇交互历史学习协调方法

夏新海 许伦辉   

  1. 华南理工大学 土木与交通学院,广东 广州 510640
  • 收稿日期:2010-05-21 修回日期:2010-10-06 出版日期:2011-03-25 发布日期:2011-02-01
  • 通信作者: 夏新海(1978-),男,博士生,广州航海高等专科学校讲师,主要从事交通运输研究 E-mail:xiaxinhai@126.com
  • 作者简介:夏新海(1978-),男,博士生,广州航海高等专科学校讲师,主要从事交通运输研究
  • 基金资助:

    国家自然科学基金资助项目(60664001)

A Multi-Interaction History Learning Approach for Coordination of Urban Intersection Agents

Xia Xin-hai  Xu Lun-hui   

  1. South China university of technology of civil and traffic institute, guangdong guangzhou 510640
  • Received:2010-05-21 Revised:2010-10-06 Online:2011-03-25 Published:2011-02-01
  • Contact: 夏新海(1978-),男,博士生,广州航海高等专科学校讲师,主要从事交通运输研究 E-mail:xiaxinhai@126.com
  • About author:夏新海(1978-),男,博士生,广州航海高等专科学校讲师,主要从事交通运输研究
  • Supported by:

    国家自然科学基金资助项目(60664001)

摘要: 为信号控制的城市道路交叉口定义一个Agent结构模型,利用双人对策Nash平衡理论构建了城市交叉口Agent间的多遇交互模型,每一交叉口Agent与相邻交叉口Agent进行多次交互学习,根据选择策略获得的效用值来更新它的混合策略.利用记忆因子δ、学习概率α、交叉口交通流变化概率βi等参数分析了交叉口Agent间的循环学习协调过程.设计了交叉口Agent多遇交互历史学习协调算法,在此算法里交叉口Agent可以通过对其他相邻交叉口Agent以往历史交互行为特别是最近的历史行为的记忆学习达到协调.以数个交叉口相连接的干道为例分析了δ、α、βi等参数对算法性能的影响.通过干道上交叉口交通信号协调的实例分析,证明了该协调学习方法的有效性.

关键词: Agent, 交通信号控制, 学习, 协调

Abstract:

Proposed in this paper is a multi-interaction history learning approach for the coordination of urban intersection agents.In the investigation,first,each signalized intersection is defined with an Agent controller.Next,a multi-interaction model for urban intersection Agents is built based on the two-person Nash equilibrium game theory to make each intersection Agent to perform multi-interaction learning with its neighbours and to update its mixed strategy according to the utility value of the selected strategy.Then,the iterative interaction learning process of intersection Agents is analyzed by using the parameters such as memory factor δ,learning probability α and local traffic change probability βi at each intersection.A multi-interactive history learning algorithm was constructed.In the proposed algorithm,intersection Agents coordinate by taking into consi-deration all history interactive information(especially the recent one) coming from neighbouring intersection Agents.Finally the effects of parameters δ,α and βi on the algorithm performance is also analyzed by the experiment of traffic signal control at some connected intersections.The results show that the proposed coordinative learning approach is effective.

Key words: Agent, traffic signal control, learning, coordination