A comprehensive approach to the analysis of matrix-assisted laser desorption/ionization-time of flight proteomics spectra from serum samples

被引:113
作者
Baggerly, KA [1 ]
Morris, JS [1 ]
Wang, J [1 ]
Gold, D [1 ]
Xiao, LC [1 ]
Coombes, KR [1 ]
机构
[1] Univ Texas, MD Anderson Canc Ctr, Dept Biostat, Houston, TX 77030 USA
关键词
cross validation; data cleaning; discrimination; genetic algorithm; Mahalanobis distance;
D O I
10.1002/pmic.200300522
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
For our analysis of the data from the First Annual Proteomics Data Mining Conference, we attempted to discriminate between 24 disease spectra (group A) and 17 normal spectra (group B). First, we processed the raw spectra by (i) correcting for additive sinusoidal noise (periodic on the time scale) affecting most spectra, (ii) correcting for the overall baseline level, (iii) normalizing, (iv) recombining fractions, and (v) using variable-width windows for data reduction. Also, we identified a set of polymeric peaks (at multiples of 180.6 Da) that is present in several normal spectra (B1-B8). After data processing, we found the intensities at the following mass to charge (m/z) values to be useful discriminators: 3077, 12 886 and 74.263. Using these values, we were able to achieve an overall classification accuracy of 38/41 (92.6%). Perfect classification could be achieved by adding two additional peaks, at 2476 and 6955. We identified these values by applying a genetic algorithm to a filtered list of m/z values using Mahalanobis distance between the group means as a fitness function.
引用
收藏
页码:1667 / 1672
页数:6
相关论文
共 9 条
[1]  
[Anonymous], 1989, GENETIC ALGORITHM SE
[2]  
[Anonymous], 1979, Multivariate analysis
[3]   Multivariate approach for selecting sets of differentially expressed genes [J].
Chilingaryan, A ;
Gevorgyan, N ;
Vardanyan, A ;
Jones, D ;
Szabo, A .
MATHEMATICAL BIOSCIENCES, 2002, 176 (01) :59-69
[4]  
Fung ET, 2002, BIOTECHNIQUES, P34
[5]  
Holland J, 1994, ADAPTATION NATURAL A
[6]   Identification and validation of a potential lung cancer serum biomarker detected by matrix-assisted laser desorption/ionization-time of flight spectra analysis [J].
Howard, BA ;
Wang, MZ ;
Campa, MJ ;
Corro, C ;
Fitzgerald, MC ;
Patz, EF .
PROTEOMICS, 2003, 3 (09) :1720-1724
[7]  
PCPHERSON RA, 1996, CLIN DIAGNOSIS MANAG, P237
[8]   Use of proteomic patterns in serum to identify ovarian cancer [J].
Petricoin, EF ;
Ardekani, AM ;
Hitt, BA ;
Levine, PJ ;
Fusaro, VA ;
Steinberg, SM ;
Mills, GB ;
Simone, C ;
Fishman, DA ;
Kohn, EC ;
Liotta, LA .
LANCET, 2002, 359 (9306) :572-577
[9]  
Siuzdak G, 1996, MASS SPECTROMETRY BI