共 46 条
[1]
[Anonymous], 1994, ON LINE Q LEARNING U
[2]
[Anonymous], 1982, GAME THEORY
[3]
[Anonymous], NUCCS9311
[4]
[Anonymous], PROC ICML
[5]
Barto A.G., 1989, 8995 U MASS DEP COMP
[7]
Benveniste A, 1990, Adaptive algorithms and stochastic approximations
[8]
Bertsekas D. P., 1996, Neuro Dynamic Programming, V1st
[9]
Bertsekas Dimitri P., 1989, PARALLEL DISTRIBUTED