Cycling in a stochastic learning algorithm for normal form games

被引:29
作者
Posch, M
机构
[1] Institut fur Medizinische Statistik, Universität Wien, A-1090 Vienna
关键词
evolutionary games; learning; bounded rationality; learning algorithms;
D O I
10.1007/s001910050041
中图分类号
F [经济];
学科分类号
02 ;
摘要
In this paper we study a stochastic learning model for 2 x 2 normal form games that are played repeatedly. The main emphasis is put on the emergence of cycles. We assume that the players have neither information about the payoff matrix of their opponent nor about their own. At every round each player can only observe his or her action and the payoff he or she receives. We prove that the learning algorithm, which is modeled by an urn scheme proposed by Arthur (1993), leads with positive probability to a cycling of strategy profiles if the game has a mixed Nash equilibrium. In case there are strict Nash equilibria, the learning process converges a.s. to the set of Nash equilibria.
引用
收藏
页码:193 / 207
页数:15
相关论文
共 23 条
[1]  
[Anonymous], [No title captured], DOI DOI 10.1007/BF01199986
[2]  
Arthur W.B., 1988, WP8886 IIASA
[3]  
ARTHUR WB, 1984, P C STOCH OPT KIEV 1
[4]  
ARTHUR WB, 1983, WP8380 IIASA
[5]  
BENAIM M, 1995, J ERGODIC THEORY DYN, V15, P1005
[6]  
BENAIM M, 1994, UNPUB LEARNING PROCE
[7]  
Benveniste A, 1990, Adaptive algorithms and stochastic approximations
[8]  
BORGERS T, 1993, UNPUB LEARNING REINF
[9]  
BUSH RR, 1955, STOCHASTIC MODELS LE
[10]  
EICHBERGER J, 1991, J ECON BEHAV ORGAN, V22, P69