Missing-data methods for generalized linear models: A comparative review

被引:318
作者
Ibrahim, JG [1 ]
Chen, MH
Lipsitz, SR
Herring, AH
机构
[1] Univ N Carolina, Dept Biostat, Chapel Hill, NC 27599 USA
[2] Univ Connecticut, Dept Stat, Storrs, CT 06269 USA
[3] Med Univ S Carolina, Dept Biometry & Epidemiol, Charleston, SC 29425 USA
基金
美国国家卫生研究院;
关键词
EM algorithm; generalized linear model; Gibbs sampling; maximum likelihood; missing at random; multiple imputation; nonignorable missing data; posterior distribution; weighted estimating equation;
D O I
10.1198/016214504000001844
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Missing data is a major issue in many applied problems, especially in the biomedical sciences. We review four common approaches for inference in generalized linear models (GLMs) with missing covariate data: maximum likelihood (ML), multiple imputation (MI), fully Bayesian (FB), and weighted estimating equations (WEEs). There is considerable interest in how these four methodologies are related, the properties of each approach, the advantages and disadvantages of each methodology, and computational implementation. We examine data that are missing at random and nonignorable missing. For ML we focus on techniques using the EM algorithm, and in particular, discuss the EM by the method of weights and related procedures as discussed by Ibrahim. For MI, we examine the techniques developed by Rubin. For FB, we review approaches considered by Ibrahim et al. For WEE, we focus on the techniques developed by Robins et al. We use a real dataset and a detailed simulation study to compare the four methods.
引用
收藏
页码:332 / 346
页数:15
相关论文
共 89 条
[1]  
[Anonymous], 1994, LOGISTIC REGRESSION
[2]  
[Anonymous], 1999, 99054 TNOVGZPG
[3]  
[Anonymous], 1997, EM ALGORITHM EXTENSI
[4]   REGRESSION-ANALYSIS FOR CATEGORICAL VARIABLES WITH OUTCOME SUBJECT TO NONIGNORABLE NONRESPONSE [J].
BAKER, SG ;
LAIRD, NM .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1988, 83 (401) :62-69
[5]  
Brahim J. G., 1992, AUST J STAT, V34, P461, DOI DOI 10.1111/J.1467-842X.1992.TB01062.X
[6]   Nonparametric and semiparametric models for missing covariates in parametric regression [J].
Chen, HY .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2004, 99 (468) :1176-1189
[7]   Proportional hazards regression with missing covariates [J].
Chen, HY ;
Little, RJA .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1999, 94 (447) :896-908
[8]   Double-semi parametric method for missing covariates in cox regression models [J].
Chen, HY .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (458) :565-576
[9]   Monte Carlo estimation of Bayesian credible and HPD intervals [J].
Chen, MH ;
Shao, QM .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 1999, 8 (01) :69-92
[10]   Maximum likelihood methods for cure rate models with missing covariates [J].
Chen, MH ;
Ibrahim, JG .
BIOMETRICS, 2001, 57 (01) :43-52