A two-state partially observable Markov decision process with uniformly distributed observations

被引:19
作者
GrosfeldNir, A [1 ]
机构
[1] NORTHWESTERN UNIV,EVANSTON,IL 60208
关键词
D O I
10.1287/opre.44.3.458
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
A controller observes a production system periodically, over time. If the system is in the GOOD state during one period, there is a constant probability that it will deteriorate and be in the BAD state during the next period (and remains there). The true state of the system is unobservable and on only be inferred from observations (quality of output). Two actions are available: CONTINUE or REPLACE (for a fixed cost). The objective is to maximize the expected discounted value of the total future income. For both the finite- and infinite-horizon problems, the optimal policy is of a CONTROL LIMIT (CLT) type: continue if the good state probability exceeds the CLT, and replace otherwise. The computation of the CLT involves a functional equation. An analytical solution for this equation is as yet unknown. For uniformly distributed observations we obtain the infinite-horizon CLT analytically. We also show that the finite horizon CLTs, as a function of the time remaining, are not necessarily monotone, which is counterintuitive.
引用
收藏
页码:458 / 463
页数:6
相关论文
共 14 条
[1]   STRUCTURAL RESULTS FOR PARTIALLY OBSERVABLE MARKOV DECISION-PROCESSES [J].
ALBRIGHT, SC .
OPERATIONS RESEARCH, 1979, 27 (05) :1041-1053
[2]  
[Anonymous], 1971, THESIS I OPERATIONS
[3]   OPTIMAL CONTROL OF PARTIALLY OBSERVABLE MARKOVIAN SYSTEMS [J].
AOKI, M .
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 1965, 280 (05) :367-&
[5]  
Bertsekas D. P, 1976, DYNAMIC PROGRAMMING
[6]   THE OPTIMAL SEARCH FOR A MOVING TARGET WHEN THE SEARCH PATH IS CONSTRAINED [J].
EAGLE, JN .
OPERATIONS RESEARCH, 1984, 32 (05) :1107-1115
[7]  
LANE DE, 1989, OPER RES, V37, P2
[8]   ORDERED SOLUTIONS FOR DYNAMIC PROGRAMS [J].
LOVEJOY, WS .
MATHEMATICS OF OPERATIONS RESEARCH, 1987, 12 (02) :269-276
[9]  
Monahan G.E., 1982, MANAGE SCI, V18, P362
[10]  
NIR AG, 1986, THESIS NW U