Prognostic meta-signature of breast cancer developed by two-stage mixture modeling of microarray data

被引:92
作者
Shen, RL
Ghosh, D [1 ]
Chinnaiyan, AM
机构
[1] Univ Michigan, Dept Biostat, Ann Arbor, MI 48109 USA
[2] Univ Michigan, Dept Pathol, Ann Arbor, MI 48109 USA
[3] Univ Michigan, Dept Urol, Ann Arbor, MI 48109 USA
[4] Univ Michigan, Ctr Comprehens Canc, Ann Arbor, MI 48109 USA
关键词
D O I
10.1186/1471-2164-5-94
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: An increasing number of studies have profiled tumor specimens using distinct microarray platforms and analysis techniques. With the accumulating amount of microarray data, one of the most intriguing yet challenging tasks is to develop robust statistical models to integrate the findings. Results: By applying a two-stage Bayesian mixture modeling strategy, we were able to assimilate and analyze four independent microarray studies to derive an inter-study validated "meta-signature" associated with breast cancer prognosis. Combining multiple studies (n = 305 samples) on a common probability scale, we developed a 90-gene meta-signature, which strongly associated with survival in breast cancer patients. Given the set of independent studies using different microarray platforms which included spotted cDNAs, Affymetrix GeneChip, and inkjet oligonucleotides, the individually identified classifiers yielded gene sets predictive of survival in each study cohort. The study-specific gene signatures, however, had minimal overlap with each other, and performed poorly in pairwise cross-validation. The meta-signature, on the other hand, accommodated such heterogeneity and achieved comparable or better prognostic performance when compared with the individual signatures. Further by comparing to a global standardization method, the mixture model based data transformation demonstrated superior properties for data integration and provided solid basis for building classifiers at the second stage. Functional annotation revealed that genes involved in cell cycle and signal transduction activities were over-represented in the meta-signature. Conclusion: The mixture modeling approach unifies disparate gene expression data on a common probability scale allowing for robust, inter-study validated prognostic signatures to be obtained. With the emerging utility of microarrays for cancer prognosis, it will be important to establish paradigms to meta-analyze disparate gene expression data for prognostic signatures of potential clinical use.
引用
收藏
页数:16
相关论文
共 34 条
[1]  
CARTER CL, 1989, CANCER-AM CANCER SOC, V63, P181, DOI 10.1002/1097-0142(19890101)63:1<181::AID-CNCR2820630129>3.0.CO
[2]  
2-H
[3]   Combining multiple microarray studies and modeling interstudy variation [J].
Choi, Jung Kyoon ;
Yu, Ungsik ;
Kim, Sangsoo ;
Yoo, Ook Joon .
BIOINFORMATICS, 2003, 19 :i84-i90
[4]   Cluster analysis and display of genome-wide expression patterns [J].
Eisen, MB ;
Spellman, PT ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) :14863-14868
[5]  
Fioravanti L, 1997, INT J CANCER, V74, P620, DOI 10.1002/(SICI)1097-0215(19971219)74:6<620::AID-IJC11>3.0.CO
[6]  
2-9
[7]  
FISHER B, 1970, SURG GYNECOL OBSTETR, V131, P79
[8]   Diagnostic classification of cancer using DNA microarrays and artificial intelligence [J].
Greer, BT ;
Khan, J .
APPLICATIONS OF BIOINFORMATICS IN CANCER DETECTION, 2004, 1020 :49-66
[9]   Gene expression predictors of breast cancer outcomes [J].
Huang, E ;
Cheng, SH ;
Dressman, H ;
Pittman, J ;
Tsou, MH ;
Horng, CF ;
Bild, A ;
Iversen, ES ;
Liao, M ;
Chen, CM ;
West, M ;
Nevins, JR ;
Huang, AT .
LANCET, 2003, 361 (9369) :1590-1596
[10]   Exploration, normalization, and summaries of high density oligonucleotide array probe level data [J].
Irizarry, RA ;
Hobbs, B ;
Collin, F ;
Beazer-Barclay, YD ;
Antonellis, KJ ;
Scherf, U ;
Speed, TP .
BIOSTATISTICS, 2003, 4 (02) :249-264