Integrative analysis of multiple cancer prognosis studies with gene expression measurements

被引:42
作者
Ma, Shuangge [1 ]
Huang, Jian [2 ]
Wei, Fengrong [3 ]
Xie, Yang [4 ]
Fang, Kuangnan [5 ]
机构
[1] Yale Univ, Sch Publ Hlth, New Haven, CT 06520 USA
[2] Univ Iowa, Dept Stat & Actuarial Sci, Iowa, IA USA
[3] Univ W Georgia, Dept Math, Carrollton, GA USA
[4] Univ Texas SW Med Ctr Dallas, Dept Clin Sci, Dallas, TX 75390 USA
[5] Xiamen Univ, Dept Stat, Xiamen, Fujian, Peoples R China
关键词
integrative analysis; cancer prognosis; microarray; penalized selection; FAILURE TIME MODEL; MICROARRAY DATA; BREAST; METAANALYSIS; REGRESSION; SELECTION; SURVIVAL;
D O I
10.1002/sim.4337
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Although in cancer research microarray gene profiling studies have been successful in identifying genetic variants predisposing to the development and progression of cancer, the identified markers from analysis of single datasets often suffer low reproducibility. Among multiple possible causes, the most important one is the small sample size hence the lack of power of single studies. Integrative analysis jointly considers multiple heterogeneous studies, has a significantly larger sample size, and can improve reproducibility. In this article, we focus on cancer prognosis studies, where the response variables are progression- free, overall, or other types of survival. A group minimax concave penalty (GMCP) penalized integrative analysis approach is proposed for analyzing multiple heterogeneous cancer prognosis studies with microarray gene expression measurements. An efficient group coordinate descent algorithm is developed. The GMCP can automatically accommodate the heterogeneity across multiple datasets, and the identified markers have consistent effects across multiple studies. Simulation studies show that the GMCP provides significantly improved selection results as compared with the existing meta- analysis approaches, intensity approaches, and group Lasso penalized integrative analysis. We apply the GMCP to four microarray studies and identify genes associated with the prognosis of breast cancer. Copyright (C) 2011 JohnWiley & Sons, Ltd. 3361
引用
收藏
页码:3361 / 3371
页数:11
相关论文
共 31 条
[1]   Novel Breast Cancer Risk Alleles and Interaction with Ionizing Radiation among US Radiologic Technologists [J].
Bhatti, Parveen ;
Doody, Michele M. ;
Rajaraman, Preetha ;
Alexander, Bruce H. ;
Yeager, Meredith ;
Hutchinson, Amy ;
Burdette, Laurie ;
Thomas, Gilles ;
Hunter, David J. ;
Simon, Steven L. ;
Weinstock, Robert M. ;
Rosenstein, Marvin ;
Stovall, Marilyn ;
Preston, Dale L. ;
Linet, Martha S. ;
Hoover, Robert N. ;
Chanock, Stephen J. ;
Sigurdson, Alice J. .
RADIATION RESEARCH, 2010, 173 (02) :214-224
[2]   COORDINATE DESCENT ALGORITHMS FOR NONCONVEX PENALIZED REGRESSION, WITH APPLICATIONS TO BIOLOGICAL FEATURE SELECTION [J].
Breheny, Patrick ;
Huang, Jian .
ANNALS OF APPLIED STATISTICS, 2011, 5 (01) :232-253
[3]  
BUCKLEY J, 1979, BIOMETRIKA, V66, P429
[4]   Gene expression profiling of breast cancer [J].
Cheang, Maggie C. U. ;
van de Rijn, Matt ;
Nielsen, Torsten O. .
ANNUAL REVIEW OF PATHOLOGY-MECHANISMS OF DISEASE, 2008, 3 :67-97
[5]   A latent variable approach for meta-analysis of gene expression data from multiple microarray experiments [J].
Choi, Hyungwon ;
Shen, Ronglai ;
Chinnaiyan, Arul M. ;
Ghosh, Debashis .
BMC BIOINFORMATICS, 2007, 8 (1)
[6]   Predicting patient survival from microarray data by accelerated failure time modeling using partial least squares and LASSO [J].
Datta, Susmita ;
Le-Rademacher, Jennifer ;
Datta, Somnath .
BIOMETRICS, 2007, 63 (01) :259-271
[7]   Regularization Paths for Generalized Linear Models via Coordinate Descent [J].
Friedman, Jerome ;
Hastie, Trevor ;
Tibshirani, Rob .
JOURNAL OF STATISTICAL SOFTWARE, 2010, 33 (01) :1-22
[8]   Meta-analysis of microarray data on pancreatic cancer defines a set of commonly dysregulated genes [J].
Grützmann, R ;
Boriss, H ;
Ammerpohl, O ;
Lüttges, J ;
Kalthoff, H ;
Schackert, HK ;
Klöppel, G ;
Saeger, HD ;
Pilarsky, C .
ONCOGENE, 2005, 24 (32) :5079-5088
[9]  
Hein S, 2010, BREAST CANC IN PRESS
[10]   Gene expression predictors of breast cancer outcomes [J].
Huang, E ;
Cheng, SH ;
Dressman, H ;
Pittman, J ;
Tsou, MH ;
Horng, CF ;
Bild, A ;
Iversen, ES ;
Liao, M ;
Chen, CM ;
West, M ;
Nevins, JR ;
Huang, AT .
LANCET, 2003, 361 (9369) :1590-1596