[1] 徐东伟, 周磊, 王达, 等. 基于深度强化学习的城市交通信号控制综述[J]. 交通运输工程与信息学报, 2022,
20(1): 16-37. [XU D W, ZHOU L, WANG D, et al.
Overview of reinforcement learning-based urban traffic
signal control[J]. Journal of Transportation Engineering
and Information, 2022, 20(1): 16-37.]
[2] WEI H, ZHENG G J, YAO H X, et al. IntelliLight:
Areinforcement learning approach for intelligent traffic
light control[C]. London: Proceedings of the 24th ACM
SIGKDD International Conference on Knowledge
Discovery & Data Mining, 2018.
[3] MAO F, LI Z H, LI L. A comparison of deep
reinforcement learning models for isolated traffic signal
control[J/OL]. IEEE Intelligent Transportation Systems
Magazine. (2022-02-14) [2022-08-04]. https://doi.org/
10.1109/MITS.2022.3144797.
[4] LI L, LV Y S, WANG F Y. Traffic signal timing via deep
reinforcement learning[J]. IEEE/CAA Journal of
Automatica Sinica, 2016, 3(3): 247-254.
[5] LIANG X Y, DU X S, WANG G L, et al. A deep
reinforcement learning network for traffic light cycle
control[J]. IEEE Transactions on Vehicular Technology,
2019, 68(2): 1243-1253.
[6] XU M, WU J P, HUANG L, et al. Network-wide traffic
signal control based on the discovery of critical nodes
and deep reinforcement learning[J]. Journal of Intelligent
Transportation Systems, 2020, 24(1): 1-10.
[7] LI C H, MA X T, XIA L, et al. Fairness control of traffic
light via deep reinforcement learning[C]. Electronic
Network: 2020 IEEE 16th International Conference on
Automation Science and Engineering (CASE), 2020.
[8] YU M R, CHAI J J, LV Y S, et al. An effective deep
reinforcement learning approach for adaptive traffic
signal control[C]. Shanghai: 2020 Chinese Automation
Congress, 2020.
[9] 马东方, 陈曦, 吴晓东, 等. 基于强化学习的干线信号混合协同优化方法[J]. 交通运输系统工程与信息,
2022, 22(2): 145-153. [MA D F, CHEN X, WU X D,
et al. Mixed- coordinated decision-making method for
arterial signals based on reinforcement learning[J].
Journal of Transportation Systems Engineering and
Information Technology, 2022, 22(2): 145-153.]
[10] ZHENG G J, XIONG Y H, ZANG X S, et al. Learning
phase competition for traffic signal control[C]. New York:
Proceedings of the 28th ACM International Conference
on Information and Knowledge Management, 2019.
[11] FAN Z, SU R, ZHANG W N, et al. Hybrid actor-critic
reinforcement learning in parameterized Action space[C].
Macao: Proceedings of the 28th International Joint
Conference on Artificial Intelligence, 2019.
[12] SCHULMAN J, WOLSKI F, DHARIWAL P, et al.
Proximal policy optimization algorithms[J]. ArXiv
Preprint ArXiv:1707.06347, 2017.
[13] YE D H, LIUZ, SUNM F, et al. Mastering complex
control in MOBA games with deep reinforcement learning
[C]. New York: 34th AAAI Conference on Artificial
Intelligence, 2020.
[14] SCHULMAN J, MORITZ P, LEVINE S, et al. Highdimensional continuous controlusing generalized
advantage estimation[J]. ArXiv Preprint ArXiv:
1506.02438, 2015.
[15] ZHANG H C, FENG S Y, LIU C, et al. CityFlow: A multiagent reinforcement learning environment for large scale
city traffic scenario[C]. San Francisco: Proceedings of the
World Wide Web Conference (WWW 2019), 2019.
[16] ZHANG G H, WANG Y H. Optimizing minimum and
maximum green time settings for traffic actuated control
at isolated intersections[J]. IEEE Transactions on
Intelligent Transportation Systems, 2011,12(1): 164-173.
[17] COOLS S B, GERSHENSON C, D' HOOGHE B. Selforganizing traffic lights: A realistic simulation[M]//
Advances in Applied Self-Organizing Systems, London:
Springer, 2013: 45-55.
|