交通运输系统工程与信息 ›› 2026, Vol. 26 ›› Issue (3): 203-213.DOI: 10.16097/j.cnki.1009-6744.2026.03.019

• 智能交通系统与信息技术 • 上一篇    下一篇

面向多目标协调的细粒度交通信号智能体建模与控制方法

陈予禾1,2a ,徐新忠3 ,姜雯文1 ,阙恒荣2b ,王屏*1   

  1. 1. 同济大学,道路与交通工程教育部重点实验室,上海201804;2.东南大学,a.交通学院, b. 网络空间安全学院,南京211102;3.上海市道路运输事业发展中心,上海200023
  • 收稿日期:2026-02-04 修回日期:2026-03-23 接受日期:2026-03-31 出版日期:2026-06-25 发布日期:2026-06-23
  • 作者简介:陈予禾(2000—),女,海南海口人,博士生。
  • 基金资助:
    上海市交通委员会科研项目 (JT2024-KY-004);宁波市科学技术局项目“科创甬江2035”重大应用示范计划 (2025Z192)。

Fine-Grained Traffic Signal Agent Modeling and Control Method for Multi-objective Coordination

CHEN Yuhe1,2a, XU Xinzhong3, JIANG Wenwen1, QUE Hengrong2b, WANG Ping*1   

  1. 1. The Key Laboratory of Road and Traffic Engineering, Ministry of Education, Tongji University, Shanghai 201804, China; 2a. School of Transportation, 2b. School of Cyber Science and Engineering, Southeast University, Nanjing 211102, China; 3. Shanghai Municipal Center for the Development of Road Transport Services, Shanghai 200023, China
  • Received:2026-02-04 Revised:2026-03-23 Accepted:2026-03-31 Online:2026-06-25 Published:2026-06-23
  • Supported by:
    Shanghai Municipal Commission of Transportation Project (JT2024-KY-004);Ningbo Municipal Bureau of Science and Technology Project: "Science and Technology Innovation Yongjiang 2035" Major Application Demonstration Program (2025Z192)。

摘要: 为解决城市交通信号控制难以兼顾通行效率、公交优先与环境影响等多目标问题,本文提出一种基于图注意力机制的交通信号智能体建模方法。首先,在路网空间建模方面,构建以“进口道”为图节点的交通路网空间建模方法,并进一步引入图注意力机制来描述路段间的空间依赖关系,增强智能体对上游交通实况的感知能力。其次,在交通流量时序建模方面,比较多层感知机与长短时记忆网络两种策略网络。然后,在多目标优化任务方面,设计根据道路实况数据进行权重自适应动态调整的奖励函数,实现通行效率、公交优先与环境影响的多目标协同优化。最后,选取一个三纵三横,边长为1.2km正方形结构的城市交通信号网格作为实验路网,对交通信号智能体进行训练,并在不同流量及波动场景下进行测试。结果表明,本文提出的交通信号智能体在各目标上均显著优于简单自适应控制。通行效率方面,在高流量下理想车速达成度提升36.21%;环境影响方面,燃油效率提升超过11.0%;公交优先方面,模型使公交通行能力提高约25.0%,车速接近性提升14.5%。因此,本文构建的多目标交通信号智能体模型具备良好的多目标协调能力,为构建高效、公平、环保的智能交通信号控制提供了可行路径。

关键词: 城市交通, 图注意力机制, 强化学习, 交通信号控制, 公交优先, 环境影响

Abstract: To address the multiple objectives challenge of urban traffic signal control that is difficult to simultaneously balance the traffic efficiency, public transport priority, and environmental impact, this paper proposes a modeling method for traffic signal agent based on a graph attention mechanism. First, in terms of road network spatial modeling, an innovative approach centered on "approach road segments" is constructed, and a graph attention mechanism is further introduced to characterize the spatial dependency relationships among road segments. Thereby it enhances the perception of agent in upstream traffic conditions. Second, in traffic flow temporal modeling, this paper compares two types of network architectures: a multilayer perceptron and a long short-term memory network. Third, for the multi-objective optimization task, a reward function with adaptive and dynamic weight adjustment based on real-time road conditions is designed, enabling the coordinated optimization of traffic efficiency, public transport priority, and environmental impact. Finally, a square urban traffic signal grid consisting of three north-south and three east-west corridors with a side length of 1.2 km is selected as the experimental road network, on which the traffic signal agent is trained and tested under different traffic volumes and fluctuation scenarios. The results show that the proposed traffic signal agent significantly outperforms a simple adaptive control across all objectives: in terms of traffic efficiency, the achievement degree of ideal vehicle speed under high traffic demand is improved by 36.21%; in terms of environmental impact, fuel efficiency is improved by more than 11.0%; and in terms of public transport priority, the model increases the number of buses passing through by approximately 25.0%, with bus speed closeness improved by 14.5%. Therefore, the agent model of multi-objective traffic signal developed in this paper demonstrates a strong multi-objective coordination capability and provides a feasible pathway for building efficient, equitable, and environmentally friendly intelligent traffic signal control systems.

Key words: urban transportation, graph attention mechanism, reinforcement learning, traffic signal control, bus priority, environment impact

中图分类号: