A neural substrate of prediction and reward

被引:6039
作者
Schultz, W
Dayan, P
Montague, PR
机构
[1] BAYLOR COLL MED, DIV NEUROSCI, CTR THEORET NEUROSCI, HOUSTON, TX 77030 USA
[2] UNIV FRIBOURG, INST PHYSIOL, CH-1700 FRIBOURG, SWITZERLAND
[3] MIT, DEPT BRAIN & COGNIT SCI, CTR BIOL & COMPUTAT LEARNING, CAMBRIDGE, MA 02139 USA
关键词
D O I
10.1126/science.275.5306.1593
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The capacity to predict future events permits a creature to detect, model, and manipulate the causal structure of its interactions with its environment. Behavioral experiments suggest that learning is driven by changes in the expectations about future salient events such as rewards and punishments. Physiological work has recently complemented these studies by identifying dopaminergic neurons in the primate whose fluctuating output apparently signals changes or errors in the predictions of future salient and rewarding events. Taken together, these findings can be understood through quantitative theories of adaptive optimizing control.
引用
收藏
页码:1593 / 1599
页数:7
相关论文
共 91 条
[71]  
SCHULTZ W, 1993, J NEUROSCI, V13, P900
[72]   RESPONSES OF MIDBRAIN DOPAMINE NEURONS TO BEHAVIORAL TRIGGER STIMULI IN THE MONKEY [J].
SCHULTZ, W .
JOURNAL OF NEUROPHYSIOLOGY, 1986, 56 (05) :1439-1461
[73]  
Schultz W, 1995, MODELS INFORMATION P, P233
[74]   A NETWORK MODEL OF CATECHOLAMINE EFFECTS - GAIN, SIGNAL-TO-NOISE RATIO, AND BEHAVIOR [J].
SERVANSCHREIBER, D ;
PRINTZ, H ;
COHEN, JD .
SCIENCE, 1990, 249 (4971) :892-895
[75]  
Skinner B., 1938, The behavior of organisms, P1
[76]   SYNAPTIC RELATIONSHIPS BETWEEN DOPAMINERGIC AFFERENTS AND CORTICAL OR THALAMIC INPUT IN THE SENSORIMOTOR TERRITORY OF THE STRIATUM IN MONKEY [J].
SMITH, Y ;
BENNETT, BD ;
BOLAM, JP ;
PARENT, A ;
SADIKOT, AF .
JOURNAL OF COMPARATIVE NEUROLOGY, 1994, 344 (01) :1-19
[77]   OPPONENT-PROCESS THEORY OF MOTIVATION .1. TEMPORAL DYNAMICS OF AFFECT [J].
SOLOMON, RL ;
CORBIT, JD .
PSYCHOLOGICAL REVIEW, 1974, 81 (02) :119-145
[78]   QUANTITATION OF SENSORY RESPONSE IN BACTERIAL CHEMOTAXIS [J].
SPUDICH, JL ;
KOSHLAND, DE .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1975, 72 (02) :710-713
[79]  
Sutton R. S., 1988, Machine Learning, V3, P9, DOI 10.1023/A:1022633531479
[80]  
Sutton R. S., 1987, P 9 ANN C COGN SCI S