Reinforcement learning for True Adaptive traffic signal control

被引：367

作者：

Abdulhai, B

Pringle, R

Karakoulas, GJ

机构：

[1] Univ Toronto, Dept Civil Engn, Intelligent Transportat Syst Ctr, Toronto, ON M5S 1A4, Canada

[2] Univ Toronto, Dept Comp Sci, Toronto, ON M5S 1A4, Canada

来源：

JOURNAL OF TRANSPORTATION ENGINEERING | 2003年 / 129卷 / 03期

关键词：

traffic signal controllers; intelligent transportation systems; traffic control; traffic management; adaptive systems;

D O I：

10.1061/(ASCE)0733-947X(2003)129:3(278)

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

The ability to exert real-time, adaptive control of transportation processes is the core of many intelligent transportation systems decision support tools. Reinforcement learning, an artificial intelligence approach undergoing development in the machine-learning community, offers key advantages in this regard. The ability of a control agent to learn relationships between control actions and their effect on the environment while pursuing a goal is a distinct improvement over prespecified models of the environment. Prespecified models are a prerequisite of conventional control methods and their accuracy limits the performance of control agents. This paper contains an introduction to Q-learning, a simple yet powerful reinforcement learning algorithm, and presents a case study involving application to traffic signal control. Encouraging results of the application to an isolated traffic signal, particularly under variable traffic conditions, are presented. A broader research effort is outlined, including extension to linear and networked signal systems and integration with dynamic route guidance. The research objective involves optimal control of heavily congested traffic across a two-dimensional road network-a challenging task for conventional traffic signal control methodologies.

引用

页码：278 / 285

页数：8

共 20 条

[11] Reinforcement learning: A survey [J].

Kaelbling, LP ;

Littman, ML ;

Moore, AW .

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 :237-285

[12]

Sadek AW, 1998, TRANSPORT RES REC, P53

[13] Controlled optimization of phases at an intersection [J].

Sen, S ;

Head, KL .

TRANSPORTATION SCIENCE, 1997, 31 (01) :5-17

[14]

SMITH RL, 1998, THESIS U AUCKLAND AU

[15] Traffic-responsive signal timing for system-wide traffic control [J].

Spall, JC ;

Chin, DC .

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 1997, 5 (3-4) :153-163

[16]

Sutton R. S., 1998, Reinforcement Learning: An Introduction, V22447

[17]

Thorpe T.L., 1997, Vehicle traffic light control using SARSA

[18]

Watkins C. J. C. H., 1989, THESIS U CAMBRIDGE C

[19]

WATKINS CJCH, 1992, MACH LEARN, V8, P279, DOI 10.1007/BF00992698

[20]

Yagar S., 1996, TRANSPORT RES REC, V1554, P1

← 1 2 →