Systematic identification of functional orthologs based on protein network comparison

被引:137
作者
Bandyopadhyay, S
Sharan, R [1 ]
Ideker, T
机构
[1] Univ Calif San Diego, Program Bioinformat, La Jolla, CA 92093 USA
[2] Univ Calif San Diego, Dept Bioengn, La Jolla, CA 92093 USA
[3] Tel Aviv Univ, Sch Comp Sci, IL-69978 Tel Aviv, Israel
关键词
D O I
10.1101/gr.4526006
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Annotating protein function across species is an important task that is often complicated by the presence of large paralogous gene families. Here, we report a novel strategy for identifying functionally related proteins that Supplements sequence-based comparisons with information on conserved protein-protein interactions. First, the protein interaction networks of two species are aligned by assigning proteins to sequence homology clusters using the Inparanoid algorithm. Next, probabilistic inference is performed on the aligned networks to identify pairs of proteins, one from each species, that are likely to retain the same function based on conservation of their interacting partners. Applying this method to Drosophila melanogaster and Saccharomyces cerevisiae, we analyze 121 cases for which functional orthology assignment is ambiguous when sequence similarity is used alone. In 61 of these cases, the network Supports a different protein pair than that favored by sequence comparisons. These results suggest that network analysis can be used to provide a key source of information for refining sequence-based homology searches.
引用
收藏
页码:428 / 435
页数:8
相关论文
共 37 条
[1]   Mass spectrometry-based proteomics [J].
Aebersold, R ;
Mann, M .
NATURE, 2003, 422 (6928) :198-207
[2]   Kap104p: A karyopherin involved in the nuclear transport of messenger RNA binding proteins [J].
Aitchison, JD ;
Blobel, G ;
Rout, MP .
SCIENCE, 1996, 274 (5287) :624-627
[3]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[4]  
BESAG J, 1974, J ROY STAT SOC B MET, V36, P192
[5]   Errors in genome annotation [J].
Brenner, SE .
TRENDS IN GENETICS, 1999, 15 (04) :132-133
[6]   Saccharomyces Genome Database (SGD) provides tools to identify and analyze sequences from Saccharomyces cerevisiae and related sequences from other organisms [J].
Christie, KR ;
Weng, S ;
Balakrishnan, R ;
Costanzo, MC ;
Dolinski, K ;
Dwight, SS ;
Engel, SR ;
Feierbach, B ;
Fisk, DG ;
Hirschman, JE ;
Hong, EL ;
Issel-Tarver, L ;
Nash, R ;
Sethuraman, A ;
Starr, B ;
Theesfeld, CL ;
Andrada, R ;
Binkley, G ;
Dong, Q ;
Lane, C ;
Schroeder, M ;
Botstein, D ;
Cherry, JM .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D311-D314
[7]   FlyBase: genes and gene models [J].
Drysdale, RA ;
Crosby, MA .
NUCLEIC ACIDS RESEARCH, 2005, 33 :D390-D395
[8]   Phylogenetic analysis and gene functional predictions: Phylogenomics in action [J].
Eisen, JA ;
Wu, M .
THEORETICAL POPULATION BIOLOGY, 2002, 61 (04) :481-487
[9]   Detecting remotely related proteins by their interactions and sequence similarity [J].
Espadaler, J ;
Aragüés, R ;
Eswar, N ;
Marti-Renom, MA ;
Querol, E ;
Avilés, FX ;
Sali, A ;
Oliva, B .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (20) :7151-7156
[10]   A NOVEL GENETIC SYSTEM TO DETECT PROTEIN PROTEIN INTERACTIONS [J].
FIELDS, S ;
SONG, OK .
NATURE, 1989, 340 (6230) :245-246