Reinforcement learning for True Adaptive traffic signal control

被引:367
作者
Abdulhai, B
Pringle, R
Karakoulas, GJ
机构
[1] Univ Toronto, Dept Civil Engn, Intelligent Transportat Syst Ctr, Toronto, ON M5S 1A4, Canada
[2] Univ Toronto, Dept Comp Sci, Toronto, ON M5S 1A4, Canada
关键词
traffic signal controllers; intelligent transportation systems; traffic control; traffic management; adaptive systems;
D O I
10.1061/(ASCE)0733-947X(2003)129:3(278)
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
The ability to exert real-time, adaptive control of transportation processes is the core of many intelligent transportation systems decision support tools. Reinforcement learning, an artificial intelligence approach undergoing development in the machine-learning community, offers key advantages in this regard. The ability of a control agent to learn relationships between control actions and their effect on the environment while pursuing a goal is a distinct improvement over prespecified models of the environment. Prespecified models are a prerequisite of conventional control methods and their accuracy limits the performance of control agents. This paper contains an introduction to Q-learning, a simple yet powerful reinforcement learning algorithm, and presents a case study involving application to traffic signal control. Encouraging results of the application to an isolated traffic signal, particularly under variable traffic conditions, are presented. A broader research effort is outlined, including extension to linear and networked signal systems and integration with dynamic route guidance. The research objective involves optimal control of heavily congested traffic across a two-dimensional road network-a challenging task for conventional traffic signal control methodologies.
引用
收藏
页码:278 / 285
页数:8
相关论文
共 20 条
[11]   Reinforcement learning: A survey [J].
Kaelbling, LP ;
Littman, ML ;
Moore, AW .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 :237-285
[12]  
Sadek AW, 1998, TRANSPORT RES REC, P53
[13]   Controlled optimization of phases at an intersection [J].
Sen, S ;
Head, KL .
TRANSPORTATION SCIENCE, 1997, 31 (01) :5-17
[14]  
SMITH RL, 1998, THESIS U AUCKLAND AU
[15]   Traffic-responsive signal timing for system-wide traffic control [J].
Spall, JC ;
Chin, DC .
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 1997, 5 (3-4) :153-163
[16]  
Sutton R. S., 1998, Reinforcement Learning: An Introduction, V22447
[17]  
Thorpe T.L., 1997, Vehicle traffic light control using SARSA
[18]  
Watkins C. J. C. H., 1989, THESIS U CAMBRIDGE C
[19]  
WATKINS CJCH, 1992, MACH LEARN, V8, P279, DOI 10.1007/BF00992698
[20]  
Yagar S., 1996, TRANSPORT RES REC, V1554, P1