Protein interactions - Two methods for assessment of the reliability of high throughput observations

被引:466
作者
Deane, CM [1 ]
Salwinski, L [1 ]
Xenarios, I [1 ]
Eisenberg, D [1 ]
机构
[1] Univ Calif Los Angeles, Howard Hughes Med Inst, Inst Mol Biol, UCLA DOE Lab Struct Biol & Mol Med, Los Angeles, CA 90095 USA
关键词
D O I
10.1074/mcp.M100037-MCP200
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
High throughput methods for detecting protein interactions require assessment of their accuracy. We present two forms of computational assessment. The first method is the expression profile reliability (EPR) index. The EPR index estimates the biologically relevant fraction of protein interactions detected in a high throughput screen. It does so by comparing the RNA expression profiles for the proteins whose interactions are found in the screen with expression profiles for known interacting and non-interacting pairs of proteins. The second form of assessment is the paralogous verification method (PVM). This method judges an interaction likely if the putatively interacting pair has paralogs that also interact. In contrast to the EPR index, which evaluates datasets of interactions, PVM scores individual interactions. On a test set, PVM identifies correctly 40% of true interactions with a false positive rate of similar to1%. EPR and PVM were applied to the Database of Interacting Proteins (DIP), a large and diverse collection of protein-protein interactions that contains over 8000 Saccharomyces cerevisiae pairwise protein interactions. Using these two methods, we estimate that similar to50% of them are reliable, and with the aid of PVM we identify confidently 3003 of them. Web servers for both the PVM and EPR methods are available on the DIP website (dip. doe-mbi.ucia.edu/Services.cgi).
引用
收藏
页码:349 / 356
页数:8
相关论文
共 36 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   BIND - The Biomolecular Interaction Network Database [J].
Bader, GD ;
Donaldson, I ;
Wolting, C ;
Ouellette, BFF ;
Pawson, T ;
Hogue, CWV .
NUCLEIC ACIDS RESEARCH, 2001, 29 (01) :242-245
[3]  
Blackford L. S., 1999, LAPACK USERS GUIDE
[4]   YPD™, PombePD™ and WormPD™:: model organism volumes of the BioKnowledge™ Library, an integrated resource for protein information [J].
Costanzo, MC ;
Crawford, ME ;
Hirschman, JE ;
Kranz, JE ;
Olsen, P ;
Robertson, LS ;
Skrzypek, MS ;
Braun, BR ;
Hopkins, KL ;
Kondu, P ;
Lengieza, C ;
Lew-Smith, JE ;
Tillberg, M ;
Garrels, JI .
NUCLEIC ACIDS RESEARCH, 2001, 29 (01) :75-79
[5]   The Yeast Proteome Database (YPD) and Caenorhabditis elegans Proteome Database (WormPD):: comprehensive resources for the organization and comparison of model organism protein information [J].
Costanzo, MC ;
Hogan, JD ;
Cusick, ME ;
Davis, BP ;
Fancher, AM ;
Hodges, PE ;
Kondu, P ;
Lengieza, C ;
Lew-Smith, JE ;
Lingner, C ;
Roberg-Perez, KJ ;
Tillberg, M ;
Brooks, JE ;
Garrels, JI .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :73-76
[6]   Observing the living genome [J].
Ferea, TL ;
Brown, PO .
CURRENT OPINION IN GENETICS & DEVELOPMENT, 1999, 9 (06) :715-722
[7]  
Fromont-Racine M, 2000, YEAST, V17, P95, DOI 10.1002/1097-0061(20000630)17:2<95::AID-YEA16>3.0.CO
[8]  
2-H
[9]   Toward a functional analysis of the yeast genome through exhaustive two-hybrid screens [J].
FromontRacine, M ;
Rain, JC ;
Legrain, P .
NATURE GENETICS, 1997, 16 (03) :277-282
[10]   Genomic expression programs in the response of yeast cells to environmental changes [J].
Gasch, AP ;
Spellman, PT ;
Kao, CM ;
Carmel-Harel, O ;
Eisen, MB ;
Storz, G ;
Botstein, D ;
Brown, PO .
MOLECULAR BIOLOGY OF THE CELL, 2000, 11 (12) :4241-4257