共 14 条
[4]
Planning and acting in partially observable stochastic domains[J] . Leslie Pack Kaelbling,Michael L. Littman,Anthony R. Cassandra.Artificial Intelligence . 1998 (1)
[5]
Elevator Group Control Using Multiple Reinforcement Learning Agents[J] . Robert H. Crites,Andrew G. Barto.Machine Learning . 1998 (2)
[7]
Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching[J] . Long-Ji Lin.Machine Learning . 1992 (3)
[8]
Q -learning[J] . Christopher J. C. H. Watkins,Peter Dayan.Machine Learning . 1992 (3)
[9]
A situated-automata approach to the design of embedded agents[J] . Leslie Pack Kaelbling.ACM SIGART Bulletin . 1991 (4)