VARIABLE SELECTION VIA GIBBS SAMPLING

被引:1680
作者
GEORGE, EI [1 ]
MCCULLOCH, RE [1 ]
机构
[1] UNIV CHICAGO,GRAD SCH BUSINESS,CHICAGO,IL 60637
关键词
DATA AUGMENTATION; HIERARCHICAL BAYES; LATENT VARIABLES; MIXTURE; MULTIPLE REGRESSION;
D O I
10.2307/2290777
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
A crucial problem in building a multiple regression model is the selection of predictors to include. The main thrust of this article is to propose and develop a procedure that uses probabilistic considerations for selecting promising subsets. This procedure entails embedding the regression setup in a hierarchical normal mixture model where latent variables are used to identify subset choices. In this framework the promising subsets of predictors can be identified as those with higher posterior probability. The computational burden is then alleviated by using the Gibbs sampler to indirectly sample from this multinomial posterior distribution on the set of possible subset choices. Those subsets with higher probability-the promising ones-can then be identified by their more frequent appearance in the Gibbs sample.
引用
收藏
页码:881 / 889
页数:9
相关论文
共 21 条
[11]  
MORRIS CN, 1987, J AM STAT ASSOC, V82, P542, DOI 10.2307/2289459
[12]  
PERICCHI LR, 1984, BIOMETRIKA, V71, P575
[13]  
POIRIER DJ, 1985, BAYESIAN STATISTICS, P711
[14]  
SMITH AFM, 1980, J ROY STAT SOC B MET, V42, P213
[15]   EFFECTS OF COLLINEARITY ON INFORMATION ABOUT REGRESSION-COEFFICIENTS [J].
SOOFI, ES .
JOURNAL OF ECONOMETRICS, 1990, 43 (03) :255-274
[16]  
SPIEGELHALTER DJ, 1982, J ROY STAT SOC B MET, V44, P377
[17]  
STEWART L, 1987, STATISTICIAN, V36, P211
[18]   THE CALCULATION OF POSTERIOR DISTRIBUTIONS BY DATA AUGMENTATION [J].
TANNER, MA ;
WING, HW .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1987, 82 (398) :528-540
[19]  
Tukey JW, 1977, EPLORATORY DATA ANAL
[20]  
Zellner A., 1986, BAYESIAN INFERENCE D, V6, P233, DOI DOI 10.4236/OJS.2013.34031