TY - JOUR
T1 - Traffic signal optimization for multiple intersections based on reinforcement learning
AU - Gu, Jaun
AU - Lee, Minhyuck
AU - Jun, Chulmin
AU - Han, Yohee
AU - Kim, Youngchan
AU - Kim, Junwon
N1 - Publisher Copyright:
© 2021 by the authors. Licensee MDPI, Basel, Switzerland.
PY - 2021/11/1
Y1 - 2021/11/1
N2 - In order to deal with dynamic traffic flow, adaptive traffic signal controls using reinforcement learning are being studied. However, most of the related studies are difficult to apply to the real field considering only mathematical optimization. In this study, we propose a reinforcement learning‐based signal optimization model with constraints. The proposed model maintains the sequence of typical signal phases and considers the minimum green time. The model was trained using Simulation of Urban MObility (SUMO), a microscopic traffic simulator. The model was evaluated in the virtual environment similar to a real road with multiple intersections connected. The performance of the proposed model was analyzed by comparing the delay and number of stops with a reinforcement learning model that did not consider constraints and a fixed‐time model. In a peak hour, the proposed model reduced the delay from 3 min 15 s to 2 min 15 s and the number of stops from 11 to 4.7 compared to the fixed‐time model.
AB - In order to deal with dynamic traffic flow, adaptive traffic signal controls using reinforcement learning are being studied. However, most of the related studies are difficult to apply to the real field considering only mathematical optimization. In this study, we propose a reinforcement learning‐based signal optimization model with constraints. The proposed model maintains the sequence of typical signal phases and considers the minimum green time. The model was trained using Simulation of Urban MObility (SUMO), a microscopic traffic simulator. The model was evaluated in the virtual environment similar to a real road with multiple intersections connected. The performance of the proposed model was analyzed by comparing the delay and number of stops with a reinforcement learning model that did not consider constraints and a fixed‐time model. In a peak hour, the proposed model reduced the delay from 3 min 15 s to 2 min 15 s and the number of stops from 11 to 4.7 compared to the fixed‐time model.
KW - Adaptive traffic signal control
KW - Deep Q‐network
KW - Multiple intersections
KW - Reinforcement learning
KW - Traffic signal optimization
UR - http://www.scopus.com/inward/record.url?scp=85119295934&partnerID=8YFLogxK
U2 - 10.3390/app112210688
DO - 10.3390/app112210688
M3 - Article
AN - SCOPUS:85119295934
SN - 2076-3417
VL - 11
JO - Applied Sciences (Switzerland)
JF - Applied Sciences (Switzerland)
IS - 22
M1 - 10688
ER -