PDP: Parallel Dynamic Programming

被引：38

作者：

FeiYue Wang ^{[1
,2
,3
,4
]}

Jie Zhang ^{[1
,5
,6
]}

Qinglai Wei ^{[1
,7
,3
]}

Xinhu Zheng ^{[1
,8
]}

Li Li ^{[1
,9
]}

机构：

[1] IEEE

[2] State Key Laboratory of Management and Control for Complex Systems (SKL-MCCS), Institute of Automation, Chinese Academy of Sciences (CASIA)

[3] School of Computer and Control Engineering, University of Chinese Academy of Sciences

[4] Research Center for Military Computational Experiments and Parallel Systems Technology,National University of Defense Technology

[5] State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences (SKL-MCCS, CASIA)

[6] Qingdao Academy of Intelligent Industries

[7] State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences(SKL-MCCS, CASIA)

[8] Department of Computer Science and Engineering, University of Minnesota

[9] Department of Automation, Tsinghua University

来源：

IEEE/CAA Journal of Automatica Sinica | 2017年 / 4卷 / 01期

关键词：

Parallel dynamic programming; Dynamic programming; Adaptive dynamic programming; Reinforcement learning; Deep learning; Neural networks; Artificial intelligence;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep reinforcement learning is a focus research area in artificial intelligence. The principle of optimality in dynamic programming is a key to the success of reinforcement learning methods. The principle of adaptive dynamic programming(ADP)is first presented instead of direct dynamic programming(DP),and the inherent relationship between ADP and deep reinforcement learning is developed. Next, analytics intelligence, as the necessary requirement, for the real reinforcement learning, is discussed. Finally, the principle of the parallel dynamic programming, which integrates dynamic programming and analytics intelligence, is presented as the future computational intelligence.

引用

页码：1 / 5

页数：5

共 6 条

[1] Control 5.0: From Newton to Merton in Popper's Cyber-Social-Physical Spaces [J].

Fei-Yue Wang .

IEEE/CAA Journal of Automatica Sinica, 2016, (03) :233-234

[2] 深度学习在控制领域的研究现状与展望 [J].

段艳杰 ;

吕宜生 ;

张杰 ;

赵学亮 ;

王飞跃 .

自动化学报, 2016, 42 (05) :643-654

[3] 平行控制:数据驱动的计算控制方法 [J].

王飞跃 .

自动化学报, 2013, 39 (04) :293-302

[4]

Implementing Adaptive Fuzzy Logic Controllers with Neural Networks: A Design Paradigm[J] . Fei-Yue Wang,Hung-man Kim.Journal of Intelligent and Fuzzy Systems . 1995 (2)

[5]

Infinite horizon self-learning optimal control of nonaffine discrete-time nonlinear systems .2 Wei Q,Liu D,Yang X. IEEE Transactions on Neural Networks and Learning Systems . 2015

[6]

Predictive analytics white paper .2 C.Nyce. . 2007

← 1 →