Journal of Transportation Systems Engineering and Information Technology ›› 2025, Vol. 25 ›› Issue (5): 333-342.DOI: 10.16097/j.cnki.1009-6744.2025.05.030

Previous Articles     Next Articles

Online Optimization Methods for Dynamic Lane Configuration at Highway Toll Plazas

MA Feihu*, CHEN Xiaoyan, SUN Cuiyu, TIAN Xingtong   

  1. School of Traffic and Transportation Engineering, East China Jiaotong University, Nanchang 330013, China
  • Received:2025-05-27 Revised:2025-06-24 Accepted:2025-07-01 Online:2025-10-25 Published:2025-10-25

高速公路收费站动态车道配置的在线优化方法

马飞虎*,陈晓燕,孙翠羽,田星彤   

  1. 华东交通大学,交通运输工程学院,南昌330013
  • 作者简介:马飞虎(1973—),男,甘肃金昌人,教授。

Abstract: This study addresses the issue of lane configuration optimization at highway toll plazas by proposing a dynamic lane configuration strategy based on reinforcement learning. A simulation environment for highway toll plazas is constructed based on the traffic behavior of vehicles passing through a toll plaza. The complex problem of lane configuration is transformed into a clear quantitative objective function that considers the operational costs of toll plazas, user delay, and congestion penalties. The lane resource allocation strategy of toll plazas is optimized dynamically through the training of a reinforcement learning network. The model is capable of real-time learning and dynamic adjustment of lane configurations to adapt to the dynamic changes in traffic flow and patterns. Experiments comparing the reinforcement learning optimization method with traditional offline optimization methods show that the proximal policy optimization (PPO) method reduces the average queue length by 12.45% and narrows the fluctuation range of average travel time by 26.94%. The PPO algorithm demonstrates advantages in reducing queue length and decreasing the fluctuation in travel time, especially during peak hours. The dynamic lane configuration strategy exhibits higher adaptability and flexibility, enhancing the operational efficiency of toll plazas.

Key words: highway transportation, dynamic lane configuration optimization, reinforcement learning, highway toll plaza; proximal policy optimization (PPO) algorithm

摘要: 针对高速公路收费站车道配置优化问题,本文提出一种基于强化学习的动态车道配置策略。根据车辆通过收费站的交通行为构建高速公路收费站仿真环境,将复杂的车道配置问题转化为考虑收费站运营成本、用户延误和拥堵惩罚的明确量化指标的目标函数,借助强化学习网络训练,动态优化收费站车道资源配置策略。模型能够实时学习并动态调整车道配置,以应对交通流量和模式的动态变化。实验对比了强化学习优化方法与传统离线优化方法,结果表明,PPO(Proximal Policy Optimization)方法在全程平均排队数上降低了12.45%,在平均通过时间的波动范围上缩小了26.94%,PPO算法在减少排队长度和降低通行时间波动方面具有优势,特别是在高峰时段动态车道配置策略展现出更高的适应性和灵活性,提升了收费站的运营效率。

关键词: 公路运输, 车道配置动态优化, 强化学习, 高速公路收费站, PPO算法

CLC Number: