MODEL SELECTION AND ACCOUNTING FOR MODEL UNCERTAINTY IN GRAPHICAL MODELS USING OCCAMS WINDOW

被引:720
作者
MADIGAN, D
RAFTERY, AE
机构
关键词
CHORDAL GRAPH; CONTINGENCY TABLE; DECOMPOSABLE LOG-LINEAR MODEL; EXPERT SYSTEM; HYPER-MARKOV DISTRIBUTION; RCURSIVE CAUSAL MODEL;
D O I
10.2307/2291017
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider the problem of model selection and accounting for model uncertainty in high-dimensional contingency tables, motivated by expert system applications. The approach most used currently is a stepwise strategy guided by tests based on approximate asymptotic P values leading to the selection of a single model; inference is then conditional on the selected model. The sampling properties of such a strategy are complex, and the failure to take account of model uncertainty leads to underestimation of uncertainty about quantities of interest. In principle, a panacea is provided by the standard Bayesian formalism that averages the posterior distributions of the quantity of interest under each of the models, weighted by their posterior model probabilities. Furthermore, this approach is optimal in the sense of maximizing predictive ability. But this has not been used in practice, because computing the posterior model probabilities is hard and the number of models is very large (often greater than 10(11)). We argue that the standard Bayesian formalism is unsatisfactory and propose an alternative Bayesian approach that, we contend, takes full account of the true model uncertainty by averaging over a much smaller set of models. An efficient search algorithm is developed for finding these models. We consider two classes of graphical models that arise in expert systems: the recursive causal models and the decomposable log-linear models. For each of these, we develop efficient ways of computing exact Bayes factors and hence posterior model probabilities. For the decomposable log-linear models, this is based on properties of chordal graphs and hyper-Markov prior distributions and the resultant calculations can be carried out locally. The end product is an overall strategy for model selection and accounting for model uncertainty that searches efficiently through the very large classes of models involved. Three examples are given. The first two concern data sets that have been analyzed by several authors in the context of model selection. The third addresses a urological diagnostic problem. In each example, our model averaging approach provides better out-of-sample predictive performance than any single model that might reasonably have been selected.
引用
收藏
页码:1535 / 1546
页数:12
相关论文
共 52 条
[1]  
Andersen L. R., 1991, J APPL STAT, V18, P139
[2]  
Berger J. O., 1987, STAT SCI, V2, P317
[3]  
BERGER JO, 1987, J AM STAT ASSOC, V82, P112, DOI 10.2307/2289131
[4]  
Bishop Y., 1975, DISCRETE MULTIVARIAT
[5]   SOME PROPERTIES OF THE DIRICHLET-MULTINOMIAL DISTRIBUTION AND ITS USE IN PRIOR ELICITATION [J].
CHALONER, K ;
DUNCAN, GT .
COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 1987, 16 (02) :511-523
[6]  
COOPER GF, 1992, MACH LEARN, V9, P309, DOI 10.1007/BF00994110
[7]  
Critchlow Douglas E., 1985, METRIC METHODS ANAL, V1
[8]   STATISTICAL-THEORY - THE PREQUENTIAL APPROACH [J].
DAWID, AP .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 1984, 147 :278-292
[9]   HYPER MARKOV LAWS IN THE STATISTICAL-ANALYSIS OF DECOMPOSABLE GRAPHICAL MODELS [J].
DAWID, AP ;
LAURITZEN, SL .
ANNALS OF STATISTICS, 1993, 21 (03) :1272-1317
[10]  
Dawid AP., 1986, ENCY STAT SCI, P210, DOI DOI 10.1002/0471667196.ESS2064.PUB2