Towards multi-agent reinforcement learning for integrated network of optimal traffic controllers (MARLIN-OTC)

被引：32

作者：

El-Tantawy, Samah ^{[1
]}

Abdulhai, Baher ^{[2
]}

机构：

[1] Univ Toronto, Dept Civil Engn, Toronto ITS Ctr & Testbed, Toronto, ON M5S 1A4, Canada

[2] Univ Toronto, Dept Civil Engn, Intelligent Transportat Syst Ctr & Testbed, Toronto, ON M5S 1A4, Canada

来源：

TRANSPORTATION LETTERS-THE INTERNATIONAL JOURNAL OF TRANSPORTATION RESEARCH | 2010年 / 2卷 / 02期

关键词：

Traffic Control; Reinforcement Learning; Game Theory; Multi-Agent Reinforcement Learning; COORDINATION;

D O I：

10.3328/TL.2010.02.02.89-110

中图分类号：

U [交通运输];

学科分类号：

08 ; 0823 ;

摘要：

Traffic congestion can be alleviated by infrastructure expansions; however, improving the existing infrastructure using traffic control is more plausible due to the obvious financial resources and physical space constraints. The most promising control tools include ramp metering, variable message signs, and signalized intersections. Synergizing the aforementioned strategies in one platform is an ultimate and challenging goal to alleviate traffic gridlock and optimally utilize the existing system capacity; this is referred to as Integrated Traffic Control (ITC). Reinforcement Learning (RL) techniques have the potential to tackle the optimal traffic control problem. Game Theory (GT) fits well in modelling the distributed control systems as multiplayer games. Multi-Agent Reinforcement Learning (MARL) achieves the potential synergy of RL and GT concepts, providing a promising tool for optimal distributed traffic control. The objective of this paper is to clarify the opportunities of game theory concepts and MARL approaches in creating an adaptive optimal traffic control system that is decentralized but yet integrated through agents' interactions. In this paper, we comparatively review and evaluate the relevant existing approaches. We then envision and introduce a novel framework that combines GT concepts and MARL to achieve a Multi-Agent Reinforcement Learning for Integrated Network of Optimal Traffic Controllers (MARLIN-OTC).

引用

页码：89 / 110

页数：22

共 65 条

[41]

Littman M. L., 2001, Cognitive Systems Research, V2, P55, DOI 10.1016/S1389-0417(01)00015-8

[42]

LITTMAN ML, 1994, P 11 INT C MACH LEAR

[43]

Liu ZY, 2007, INT J COMPUT SCI NET, V7, P105

[44]

Maynard Smith J., 1982, pi

[45]

Mendelson E., 2004, INTRO GAME THEORY IT

[46]

Mikami S., 1994, Proceedings of the First IEEE Conference on Evolutionary Computation. IEEE World Congress on Computational Intelligence (Cat. No.94TH0650-2), P223, DOI 10.1109/ICEC.1994.350012

[47]

Milano P. D., 2006, THESIS

[48]

Oliveira D, 2006, P 4 EUR WORKSH MULT, P31

[49]

Osborne M.J., 2004, An introduction to game theory, V3

[50]

Papageorgiou M, 1998, NATO ADV SCI I F-COM, V166, P46

← 1 2 3 4 5 6 7 →