SMOOTHING CATEGORICAL-DATA

被引:33
作者
SIMONOFF, JS [1 ]
机构
[1] NYU,STERN SCH BUSINESS,DEPT STAT & OPERAT RES,NEW YORK,NY 10012
关键词
BAYES METHODS; KERNEL ESTIMATION; PENALIZED LIKELIHOOD; SHRINKAGE ESTIMATION;
D O I
10.1016/0378-3758(94)00121-B
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Statistical analysis of categorical data (contingency tables) has a long history, and a good deal of work has been done formulating parametric models for such data. Unfortunately, such analyses are often not appropriate, due to sparseness of the table. An alternative to these parametric models is smoothing the table, by 'borrowing' information from neighboring cells. In this paper, various strategies that have been proposed for such smoothing are discussed. It is shown that these strategies have close ties to other areas of statistical methodology, including shrinkage estimation, Bayes methods, penalized likelihood, spline estimation, and kernel density and regression estimation. Probability estimates based on smoothing methods can outperform the unsmoothed frequency estimates when the table is sparse (often, dramatically so). Methods for one-dimensional tables are discussed, as well as generalizations to higher-dimensional tables. Attempts to use smoothed probability estimates in statistical functionals are identified. Finally, potential future work in categorical data smoothing is also mentioned.
引用
收藏
页码:41 / 69
页数:29
相关论文
共 116 条
[1]  
Agresti A., 1984, ANAL ORDINAL CATEGOR
[2]  
Agresti A., 1990, CATEGORICAL DATA ANA
[3]   MULTIVARIATE BINARY DISCRIMINATION BY KERNEL METHOD [J].
AITCHISON, J ;
AITKEN, CGG .
BIOMETRIKA, 1976, 63 (03) :413-420
[5]  
AKAIKE H, 1974, IEEE AC, V19, P715
[6]   PSEUDO-BAYES ESTIMATION OF MULTINOMIAL PROPORTIONS [J].
ALBERT, J .
COMMUNICATIONS IN STATISTICS PART A-THEORY AND METHODS, 1981, 10 (16) :1587-1611
[7]   BAYESIAN-ESTIMATION METHODS FOR 2X2 CONTINGENCY-TABLES USING MIXTURES OF DIRICHLET DISTRIBUTIONS [J].
ALBERT, JH ;
GUPTA, AK .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1983, 78 (383) :708-717
[8]   EMPIRICAL BAYES ESTIMATION IN CONTINGENCY-TABLES [J].
ALBERT, JH .
COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 1987, 16 (08) :2459-2485
[9]  
ALBERT JH, 1983, J ROY STAT SOC B MET, V45, P60
[10]   MIXTURES OF DIRICHLET DISTRIBUTIONS AND ESTIMATION IN CONTINGENCY-TABLES [J].
ALBERT, JH ;
GUPTA, AK .
ANNALS OF STATISTICS, 1982, 10 (04) :1261-1268