[1] ROBERTSON D I. "TRANSYT" method for area traffic
control[J]. Traffic Engineering & Control, 1969, 10(6):
181-182.
[2] ROBERTSON D I, BRETHERTON R D. Optimizing
networks of traffic signals in real time-the SCOOT method
[J]. IEEE Transactions on Vehicular Technology, 1991,
40(1): 11-15.
[3] SIMS A G, DOBINSON K W. The sydney coordinated
adaptive traffic (SCAT) system philosophy and benefits
[J]. IEEE Transactions on Vehicular Technology, 1980,
29(2): 130-137.
[4] VARAIYA P. Max pressure control of a network of
signalized intersections[J]. Transportation Research Part
C: Emerging Technologies, 2013, 36: 177-195.
[5] 马东方, 陈曦, 吴晓东, 等. 基于强化学习的干线信号混合协同优化方法[J]. 交通运输系统工程与信息,
2022, 22(2): 145-153. [MA D F, CHEN X, WU X D,
et al. Mixed-coordinated decision-making method for
arterial signals based on reinforcement learning[J].
Journal of Transportation Systems Engineering and
Information Technology, 2022, 22(2): 145-153.]
[6] WEI H, CHEN C, ZHENG G, et al. Presslight: Learning
max pressure control to coordinate traffic signals in
arterial network[C]. Anchorage: Proceedings of the 25th
ACM SIGKDD International Conference on Knowledge
Discovery & Data Mining, 2019.
[7] ZHENG G J, XIONG Y H, ZANG X S, et al. Learning
phase competition for traffic signal control[C]. Beijing:
Proceedings of the 28th ACM International Conference
on Information and Knowledge Management, 2019.
[8] CHEN C C, WEI H, XU N, et al. Toward a thousand
lights: Decentralized deep reinforcement learning for
large-scale traffic signal control[C]. New York:
Proceedings of the AAAI Conference on Artificial
Intelligence, 2020.
[9] PAPOUDAKIS G, CHRISTIANOS F, RAHMAN A, et al.
Dealing with non-stationarity in multi-agent deep
reinforcement learning[J]. ArXiv Preprint ArXiv, 2019,
1906: 04737.
[10] WEI H, XU N, ZHANG H C, et al. CoLight: Learning
network-level cooperation for traffic signal control[C].
Beijing: Proceedings of the 28th ACM International
Conference on Information and Knowledge Management,
2019.
[11] ZHANG L, WU Q, SHEN J, et al. Expression might be
enough: Representing pressure and demand for
reinforcement learning based traffic signal control[C].
Maryland: Proceedings of the 39th International Conference on Machine Learning, 2022.
[12] MAO F, LI Z H, LI L. A comparison of deep
reinforcement learning models for isolated traffic signal
control[J]. IEEE Intelligent Transportation Systems
Magazine, 2022, 15(1): 160-180.
[13] XU M, WU J P, HUANG L, et al. Network-wide traffic
signal control based on the discovery of critical nodes
and deep reinforcement learning[J]. Journal of Intelligent
Transportation Systems, 2020, 24(1): 1-10.
[14] WEN M N, KUBA J G, LIN R J, et al. Multi-agent
reinforcement learning is a sequence modeling problem
[J]. Advances in Neural Information Processing Systems,
2022, 35: 16509-16521.
[15] SCHULMAN J, WOLSKI F, DHARIWAL P, et al.
Proximal policy optimization algorithms[J]. ArXiv
Preprint ArXiv, 2017, 1707: 06347.
[16] KUBA J G, WEN M N, MENG L H, et al. Settling the
variance of multi-agent policy gradients[J]. Advances in
Neural Information Processing Systems, 2021, 34:
13458-13470.
[17] SCHULMAN J, MORITZ P, LEVINE S, et al. High-dimensional continuous control using generalized
advantage estimation[J]. ArXiv Preprint ArXiv, 2015,
1506: 02438.
[18] ZHANG H C, FENG S Y, LIU C, et al. CityFlow: A multi-agent reinforcement learning environment for large scale
city traffic scenario[C]. San Francisco: Proceedings of the
World Wide Web Conference (WWW 2019), 2019.
[19] KOONCE P, RODEGERDTS L, LEE K, et al. Traffic
signal timing manual[R]. Washington: Federal Highway
Administration, 2008.
[20] COOLS S B, GERSHENSON C, D'HOOGHE B. Self-organizing traffic lights: A realistic simulation[M].
London: Advances in Applied Self-Organizing Systems,
2013.
[21] OROOJLOOY A, NAZARI M, HAJINEZHAD D, et al.
Attendlight: Universal attention-based reinforcement
learning model for traffic signal control[J]. Advances in
Neural Information Processing Systems, 2020, 33: 4079-
4090.
[22] ZHANG G H, WANG Y H. Optimizing minimum and
maximum green time settings for traffic actuated control
at isolated intersections[J]. IEEE Transactions on
Intelligent Transportation Systems, 2011, 12(1): 164-
173.
|