Correlation between gene expression and GO semantic similarity

被引:163
作者
Sevilla, JL
Segura, V
Podhorski, A
Guruceaga, E
Mato, JM
Martínez-Cruz, LA
Corrales, FJ
Rubio, A
机构
[1] Strathmore Univ, Nairobi 00200, Kenya
[2] CEIT & Tecnun, Dept Bioinformat, San Sebastian 20018, Spain
[3] CIC Biogune, Zamudio, Vizcaya, Spain
[4] Univ Navarra, Dept Internal Med, Unit Prote Genom & Bioinformat, Pamplona 31008, Spain
关键词
expression analysis; gene ontology; semantic similarity;
D O I
10.1109/TCBB.2005.50
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
This research analyzes some aspects of the relationship between gene expression, gene function, and gene annotation. Many recent studies are implicitly based on the assumption that gene products that are biologically and functionally related would maintain this similarity both in their expression profiles as well as in their Gene Ontology (GO) annotation. We analyze how accurate this assumption proves to be using real publicly available data. We also aim to validate a measure of semantic similarity for GO annotation. We use the Pearson correlation coefficient and its absolute value as a measure of similarity between expression profiles of gene products. We explore a number of semantic similarity measures (Resnik, Jiang, and Lin) and compute the similarity between gene products annotated using the GO. Finally, we compute correlation coefficients to compare gene expression similarity against GO semantic similarity. Our results suggest that the Resnik similarity measure outperforms the others and seems better suited for use in Gene Ontology. We also deduce that there seems to be correlation between semantic similarity in the GO annotation and gene expression for the three GO ontologies. We show that this correlation is negligible up to a certain semantic similarity value; then, for higher similarity values, the relationship trend becomes almost linear. These results can be used to augment the knowledge provided by clustering algorithms and in the development of bioinformatic tools for finding and characterizing gene products.
引用
收藏
页码:330 / 338
页数:9
相关论文
共 28 条
[1]   FatiGO:: a web tool for finding significant associations of Gene Ontology terms with groups of genes [J].
Al-Shahrour, F ;
Díaz-Uriarte, R ;
Dopazo, J .
BIOINFORMATICS, 2004, 20 (04) :578-580
[2]  
[Anonymous], CA0395 DUBL CIT U SC
[3]  
[Anonymous], INT J LEXICOGRAPHY
[4]  
[Anonymous], P WORKSH WORDNET OTH
[5]  
AZUAJE F, 2004, P IEEE 4 S BIOINF BI
[6]   The gene ontology annotation (GOA) project: Implementation of GO in SWISS-PROT, TrEMBL, and InterPro [J].
Camon, E ;
Magrane, M ;
Barrell, D ;
Binns, D ;
Fleischmann, W ;
Kersey, P ;
Mulder, N ;
Oinn, T ;
Maslen, J ;
Cox, A ;
Apweiler, R .
GENOME RESEARCH, 2003, 13 (04) :662-672
[7]  
Couto FM, 2003, IMPLEMENTATION FUNCT
[8]   DNA microarrays in drug discovery and development [J].
Debouck, C ;
Goodfellow, PN .
NATURE GENETICS, 1999, 21 (Suppl 1) :48-50
[9]   Cluster analysis and display of genome-wide expression patterns [J].
Eisen, MB ;
Spellman, PT ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) :14863-14868
[10]   Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring [J].
Golub, TR ;
Slonim, DK ;
Tamayo, P ;
Huard, C ;
Gaasenbeek, M ;
Mesirov, JP ;
Coller, H ;
Loh, ML ;
Downing, JR ;
Caligiuri, MA ;
Bloomfield, CD ;
Lander, ES .
SCIENCE, 1999, 286 (5439) :531-537