Approaches to dimensionality reduction in proteomic biomarker studies

被引:111
作者
Hilario, Melanie [1 ]
Kalousis, Alexandros [1 ]
机构
[1] Univ Geneva, Artificial Intelligence Lab, CH-1211 Geneva 4, Switzerland
关键词
proteomics; mass spectra; biomarkers; dimensionality reduction; feature transformation; feature selection;
D O I
10.1093/bib/bbn005
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Mass-spectra based proteomic profiles have received widespread attention as potential tools for biomarker discovery and early disease diagnosis. A major data-analytical problem involved is the extremely high dimensionality (i.e. number of features or variables) of proteomic data, in particular when the sample size is small. This article reviews dimensionality reduction methods that have been used in proteomic biomarker studies. It then focuses on the problem of selecting the most appropriate method for a specific task or dataset, and proposes method combination as a potential alternative to single-method selection. Finally, it points out the potential of novel dimension reduction techniques, in particular those that incorporate domain knowledge through the use of informative priors or causal inference.
引用
收藏
页码:102 / 118
页数:17
相关论文
共 97 条
[1]  
Achlioptas D., 2001, P 20 ACM SIGMOD SIGA, P274, DOI DOI 10.1145/375551.375608
[2]  
Adam BL, 2002, CANCER RES, V62, P3609
[3]  
Aliferis C F, 2003, AMIA Annu Symp Proc, P21
[4]   Selection bias in gene extraction on the basis of microarray gene-expression data [J].
Ambroise, C ;
McLachlan, GJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (10) :6562-6566
[5]  
[Anonymous], [No title captured]
[6]  
[Anonymous], PATTERN CLASSIFICATI
[7]   A comprehensive approach to the analysis of matrix-assisted laser desorption/ionization-time of flight proteomics spectra from serum samples [J].
Baggerly, KA ;
Morris, JS ;
Wang, J ;
Gold, D ;
Xiao, LC ;
Coombes, KR .
PROTEOMICS, 2003, 3 (09) :1667-1672
[8]   Diagnostic potential of serum proteomic patterns in prostate cancer [J].
Bañez, LL ;
Prasanna, P ;
Sun, L ;
Ali, A ;
Zou, ZQ ;
Adam, BL ;
McLeod, DG ;
Moul, JW ;
Srivastava, S .
JOURNAL OF UROLOGY, 2003, 170 (02) :442-446
[9]   Model order selection for bio-molecular data clustering [J].
Bertoni, Alberto ;
Valentini, Giorgio .
BMC BIOINFORMATICS, 2007, 8 (Suppl 2)
[10]   A robust meta-classification strategy for cancer detection from MS data [J].
Bhanot, G ;
Alexe, G ;
Venkataraghavan, B ;
Levine, AJ .
PROTEOMICS, 2006, 6 (02) :592-604