Regularized estimation of large covariance matrices

被引:836
作者
Bickel, Peter J. [1 ]
Levina, Elizaveta [2 ]
机构
[1] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
[2] Univ Michigan, Dept Stat, Ann Arbor, MI 48109 USA
关键词
covariance matrix; regularization; banding; Cholesky decomposition;
D O I
10.1214/009053607000000758
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This paper considers estimating a covariance matrix of p variables from n observations by either banding or tapering the sample covariance matrix, or estimating a banded version of the inverse of the covariance. We show that these estimates are consistent in the operator norm as long as (log p)/n -> 0, and obtain explicit rates. The results are uniform over some fairly natural well-conditioned families of covariance matrices. We also introduce an analogue of the Gaussian white noise model and show that if the population covariance is embeddable in that model and well-conditioned, then the banded approximations produce consistent estimates of the eigenvalues and associated eigenvectors of the covariance matrix. The results can be extended to smooth versions of banding and to non-Gaussian distributions with sufficiently short tails. A resampling approach is proposed for choosing the banding parameter in practice. This approach is illustrated numerically on both simulated and real data.
引用
收藏
页码:199 / 227
页数:29
相关论文
共 35 条
[1]  
Anderson TW., 1958, An introduction to multivariate statistical analysis
[2]  
[Anonymous], 1946, COMMENT MATH HELV
[3]   LIMIT OF THE SMALLEST EIGENVALUE OF A LARGE DIMENSIONAL SAMPLE COVARIANCE-MATRIX [J].
BAI, ZD ;
YIN, YQ .
ANNALS OF PROBABILITY, 1993, 21 (03) :1275-1294
[4]  
BARDET JM, 2002, THEORY APPL LONG RAN, P579
[5]  
Bickel PJ, 2006, J MACH LEARN RES, V7, P705
[6]   Some theory for Fisher's linear discriminant function, 'naive Bayes', and some alternatives when there are many more variables than observations [J].
Bickel, PJ ;
Levina, E .
BERNOULLI, 2004, 10 (06) :989-1010
[7]  
Bottcher A., 1996, Fields Inst. Monogr., V3, P1
[8]  
DASPREMONT A, 2007, IN PRESS SIAM J MATR
[9]  
DeVore Ronald A., 1993, CONSTRUTIVE APPROXIM, V303
[10]  
DONOHO DL, 1995, J ROY STAT SOC B MET, V57, P301