Stochastic algorithms, symmetric Markov perfect equilibrium, and the 'curse' of dimensionality

被引：85

作者：

Pakes, A ^{[1
]}

McGuire, P

机构：

[1] Harvard Univ, Dept Econ, Littauer Ctr 117, Cambridge, MA 02138 USA

[2] Yale Univ, Econ Growth Ctr, New Haven, CT 06520 USA

来源：

ECONOMETRICA | 2001年 / 69卷 / 05期

关键词：

dynamic games; stochastic algorithms; curse of dimensionality;

D O I：

10.1111/1468-0262.00241

中图分类号：

F [经济];

学科分类号：

02 ;

摘要：

This paper introduces a stochastic algorithm for computing symmetric Markov perfect equilibria. The algorithm computes equilibrium policy and value functions, and generates a transition kernel for the (stochastic) evolution of the state of the system, It has two features that together imply that it need not be subject to the curse of dimensionality. First, the integral that determines continuation values is never calculated; rather it is approximated by a simple average of returns from past outcomes of the algorithm, an approximation whose computational burden is not tied to the dimension of the state space. Second, iterations of the algorithm update value and policy functions at a single (rather than at all possible) points in the state space. Random draws from a distribution set by the updated policies determine the location of the next iteration's updates. This selection only repeatedly hits the recurrent class of points, a subset whose cardinality is not directly tied to that of the state space. Numerical results for industrial organization problems show that our algorithm can increase speed and decrease memory requirements by several orders of magnitude.

引用

页码：1261 / 1281

页数：21

共 21 条

[1]

[Anonymous], 1983, MARKOV CHAINS

[2]

[Anonymous], 8024 NBER

[3] LEARNING TO ACT USING REAL-TIME DYNAMIC-PROGRAMMING [J].

BARTO, AG ;

BRADTKE, SJ ;

SINGH, SP .

ARTIFICIAL INTELLIGENCE, 1995, 72 (1-2) :81-138

[4]

Bertsekas D., 1996, NEURO DYNAMIC PROGRA, V1st

[5]

Bertsekas D.P., 2005, DYNAMIC PROGRAMMING, V1

[6]

Bertsekas DP, 1995, Dynamic Programming and Optimal Control, V2

[7] MULTIDIMENSIONAL STOCHASTIC APPROXIMATION METHODS [J].

BLUM, JR .

ANNALS OF MATHEMATICAL STATISTICS, 1954, 25 (04) :737-744

[8] MARKOV-PERFECT INDUSTRY DYNAMICS - A FRAMEWORK FOR EMPIRICAL WORK [J].

ERICSON, R ;

PAKES, A .

REVIEW OF ECONOMIC STUDIES, 1995, 62 (01) :53-82

[9]

FUDENBERG D, 1999, LEARNING GAMES

[10] Efficient representation of state spaces for some dynamic models [J].

Gowrisankaran, G .

JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 1999, 23 (08) :1077-1098

← 1 2 3 →