Journal of Transportation Systems Engineering and Information Technology ›› 2023, Vol. 23 ›› Issue (1): 77-85.DOI: 10.16097/j.cnki.1009-6744.2023

Previous Articles     Next Articles

A Combination Model for Connected and Autonomous Vehicles Lane-changing Decision-making Under Multi Connectivity Range

ZHAO Jian-dong*1a,1b, HE Xiao-yu1a, YU Zhi-xin1a, HAN Ming-min2   

  1. 1a. School of Traffic and Transportation, 1b. Key Laboratory of Transport Industry of Big Data Application Technologies for Comprehensive Transport, Beijing Jiaotong University, Beijing 100044, China; 2. Hebei Provincial Communications Planning, Design and Research Institute Co. Ltd., Shijiazhuang 050000, China
  • Received:2022-10-31 Revised:2022-12-07 Accepted:2022-12-19 Online:2023-02-25 Published:2023-02-16
  • Supported by:
    National Key Research and Development Program (2019YFB1600200); National Natural Science Foundation of China (71931002,71871011)

多网联范围下的智能网联车换道决策组合模型研究

赵建东*1a,1b,贺晓宇1a,余智鑫1a,韩明敏2   

  1. 1.北京交通大学,a.交通运输学院,b.综合交通运输大数据应用技术交通运输行业重点实验室,北京100044; 2.河北省交通规划设计研究院有限公司,石家庄 050000
  • 作者简介:赵建东(1975- ),男,山西忻州人,教授,博士。
  • 基金资助:
    国家重点研发计划(2019YFB1600200);国家自然科学基金(71931002,71871011)

Abstract: In order to improve the lane-changing efficiency of intelligent connected vehicles (ICV) under different network connection ranges, combined with deep reinforcement learning and molecular dynamics theory, a double deep Q network lane-changing decision model integrating the masking mechanism and attention mechanism (MAQ) is proposed. Firstly, in the Simulation of Urban Mobility (SUMO) simulation environment, the driving status information of connected vehicles and human drive vehicles (HDV) within the network range is collected. Secondly, the MAQ model is built, the mask mechanism and attention mechanism are adopted to achieve fixed model input size and displacement invariance. Thirdly, in order to quantify the degree of influence between vehicles, the relative speed and the relative position between vehicles are used as parameters, and the molecular dynamics theory is used to give weights to HDV information within the connectivity range. Finally, different lane- changing decision models and weighting methods are compared in different traffic density simulation environments. The effect of lane change decision is tested under different connectivity ranges (80~330 meters, with an interval of 50 meters). The simulation results show that, taking 40 HDVs and a 100-meter connectivity range as an example, the MAQ model has a 90.2% improvement in fitting accuracy compared with the DeepSet-Q model; compared with the linear weighting method, the molecular dynamics weighting method increases the total reward value by 5.5% , and the average speed of ICV by 4.4%; with the expansion of the connectivity range, the average speed of ICV shows a change rule of first increasing, then decreasing, and then tending to be stable.

Key words: intelligent transportation, intelligent connected vehicles, lane change decision, deep reinforcement learning, molecular dynamics, connectivity range

摘要: 为提升不同网联范围下智能网联车(Intelligent Connected Vehicles, ICV)的换道效率,结合深度强化学习和分子动力学理论,提出一种融合掩码机制和注意力机制的双深度Q网络(MaskAttention-DDQN, MAQ)换道决策模型。首先,在SUMO (Simulation of Urban Mobility)仿真环境中采集网联范围内ICV及人工驾驶车辆(Human Drive Vehicles, HDV)的行驶状态信息。其次,搭建MAQ模型,采用掩码机制和注意力机制方法,实现固定模型输入大小,以及实现置换不变性。第三,为实现车辆间影响程度的数值化,以车辆间相对速度和相对位置为参数,使用分子动力学理论为网联范围内HDV信息赋予权重。最后,分别在不同交通密度仿真环境中对不同换道决策模型和赋权方法进行对比,并测试ICV在不同网联范围(80~330 m,以50 m为间隔)下的换道决策效果。仿真结果表明,以40辆HDV、100m网联范围为例,MAQ模型比DeepSet-Q模型拟合精度提高了90.2%;分子动力学赋权方法相比线性权重赋权方法总奖励值提高了5.5%,ICV平均车速提高了4.8%;ICV平均车速随着网联范围的扩大,呈现出先增大、再减小、后趋于平稳的变化规律。

关键词: 智能交通, 智能网联车, 换道决策, 深度强化学习, 分子动力学, 网联范围

CLC Number: