Towards multi-agent reinforcement learning for integrated network of optimal traffic controllers (MARLIN-OTC)

被引:32
作者
El-Tantawy, Samah [1 ]
Abdulhai, Baher [2 ]
机构
[1] Univ Toronto, Dept Civil Engn, Toronto ITS Ctr & Testbed, Toronto, ON M5S 1A4, Canada
[2] Univ Toronto, Dept Civil Engn, Intelligent Transportat Syst Ctr & Testbed, Toronto, ON M5S 1A4, Canada
来源
TRANSPORTATION LETTERS-THE INTERNATIONAL JOURNAL OF TRANSPORTATION RESEARCH | 2010年 / 2卷 / 02期
关键词
Traffic Control; Reinforcement Learning; Game Theory; Multi-Agent Reinforcement Learning; COORDINATION;
D O I
10.3328/TL.2010.02.02.89-110
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
Traffic congestion can be alleviated by infrastructure expansions; however, improving the existing infrastructure using traffic control is more plausible due to the obvious financial resources and physical space constraints. The most promising control tools include ramp metering, variable message signs, and signalized intersections. Synergizing the aforementioned strategies in one platform is an ultimate and challenging goal to alleviate traffic gridlock and optimally utilize the existing system capacity; this is referred to as Integrated Traffic Control (ITC). Reinforcement Learning (RL) techniques have the potential to tackle the optimal traffic control problem. Game Theory (GT) fits well in modelling the distributed control systems as multiplayer games. Multi-Agent Reinforcement Learning (MARL) achieves the potential synergy of RL and GT concepts, providing a promising tool for optimal distributed traffic control. The objective of this paper is to clarify the opportunities of game theory concepts and MARL approaches in creating an adaptive optimal traffic control system that is decentralized but yet integrated through agents' interactions. In this paper, we comparatively review and evaluate the relevant existing approaches. We then envision and introduce a novel framework that combines GT concepts and MARL to achieve a Multi-Agent Reinforcement Learning for Integrated Network of Optimal Traffic Controllers (MARLIN-OTC).
引用
收藏
页码:89 / 110
页数:22
相关论文
共 65 条
[51]  
Richter Silvia., 2007, Advances in Neural Information Processing Systems, V19
[52]  
Salkham As'ad, 2008, 2008 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, P560, DOI 10.1109/WIIAT.2008.88
[53]  
Shoham Y., 2003, Multi-agent reinforcement learning: a critical survey
[54]   THE SYDNEY COORDINATED ADAPTIVE TRAFFIC (SCAT) SYSTEM PHILOSOPHY AND BENEFITS [J].
SIMS, AG ;
DOBINSON, KW .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 1980, 29 (02) :130-137
[55]  
Steingrover M.S. R. P. S. N. E. B. B., 2005, Proceedings of the 17th Belgium-Netherlands Conference on Artificial Intelligence, VOctober 2005, P216
[56]  
Sutton R.S., 2017, Introduction to reinforcement learning
[57]  
Thorpe T., 1997, VEHICLE TRAFFIC LIGH
[58]  
van Katwijk RT, 2005, WHITESTEIN SER SOFTW, P113
[59]  
Wang X., 2002, ADV NEURAL INFORM PR, V15, P1603
[60]  
Wang Y., 2007, MODELING INFORM CONT, P281