TEMPORAL DIFFERENCE LEARNING AND TD-GAMMON

被引:910
作者
TESAURO, G
机构
关键词
D O I
10.1145/203330.203343
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
[No abstract available]
引用
收藏
页码:58 / 68
页数:11
相关论文
共 16 条
[1]  
[Anonymous], 1990, ADV NEURAL INF PROCE
[2]   COMPUTER BACKGAMMON [J].
BERLINER, H .
SCIENTIFIC AMERICAN, 1980, 242 (06) :64-&
[3]  
Dayan P, 1994, ADV NEURAL INFORM PR, P817
[4]   TOWARD AN IDEAL TRAINER [J].
EPSTEIN, SL .
MACHINE LEARNING, 1994, 15 (03) :251-277
[5]  
FAWCETT TE, 1992, MACHINE LEARNING /, P144
[6]   MULTILAYER FEEDFORWARD NETWORKS ARE UNIVERSAL APPROXIMATORS [J].
HORNIK, K ;
STINCHCOMBE, M ;
WHITE, H .
NEURAL NETWORKS, 1989, 2 (05) :359-366
[7]  
ISABELLE JF, 1993, THESIS U MONTREAL
[8]  
MAGRIEL P, 1976, BACKGAMMON
[9]  
Robertie B., 1992, INSIDE BACKGAMMON, V2, P14
[10]  
Rumelhart DE, 1986, ENCY DATABASE SYST, P45