Comprehensive de novo structure prediction in a systems-biology context for the archaea Halobacterium sp NRC-1 -: art. no. R52

被引:36
作者
Bonneau, R [1 ]
Baliga, NS [1 ]
Deutsch, EW [1 ]
Shannon, P [1 ]
Hood, L [1 ]
机构
[1] Inst Syst Biol, Seattle, WA 98103 USA
关键词
D O I
10.1186/gb-2004-5-8-r52
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Large fractions of all fully sequenced genomes code for proteins of unknown function. Annotating these proteins of unknown function remains a critical bottleneck for systems biology and is crucial to understanding the biological relevance of genome-wide changes in mRNA and protein expression, protein-protein and protein-DNA interactions. The work reported here demonstrates that de novo structure prediction is now a viable option for providing general function information for many proteins of unknown function. Results: We have used Rosetta de novo structure prediction to predict three-dimensional structures for 1,185 proteins and protein domains (<150 residues in length) found in Halobacterium NRC-1, a widely studied halophilic archaeon. Predicted structures were searched against the Protein Data Bank to identify fold similarities and extrapolate putative functions. They were analyzed in the context of a predicted association network composed of several sources of functional associations such as: predicted protein interactions, predicted operons, phylogenetic profile similarity and domain fusion. To illustrate this approach, we highlight three cases where our combined procedure has provided novel insights into our understanding of chemotaxis, possible prophage remnants in Halobacterium NRC-1 and archaeal transcriptional regulators. Conclusions: Simultaneous analysis of the association network, coordinated mRNA level changes in microarray experiments and genome-wide structure prediction has allowed us to glean significant biological insights into the roles of several Halobacterium NRC-1 proteins of previously unknown function, and significantly reduce the number of proteins encoded in the genome of this haloarchaeon for which no annotation is available.
引用
收藏
页数:15
相关论文
共 77 条
[1]   Predictions without templates: New folds, secondary structure, and contacts in CASP5 [J].
Aloy, P ;
Stark, A ;
Hadley, S ;
Russell, RB .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2003, 53 :436-456
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   Probing conservation of HAMP linker structure and signal transduction mechanism through analysis of hybrid sensor kinases [J].
Appleman, JA ;
Chen, LL ;
Stewart, V .
JOURNAL OF BACTERIOLOGY, 2003, 185 (16) :4872-4882
[4]   Mutational analysis of a conserved signal-transducing element:: the HAMP linker of the Escherichia coli nitrate sensor NarX [J].
Appleman, JA ;
Stewart, V .
JOURNAL OF BACTERIOLOGY, 2003, 185 (01) :89-97
[5]  
Aravind L, 1999, FEMS MICROBIOL LETT, V176, P111, DOI 10.1111/j.1574-6968.1999.tb13650.x
[6]   Systems level insights into the stress response to UV radiation in the halophilic archaeon Halobacterium NRC-1 [J].
Baliga, NS ;
Bjork, SJ ;
Bonneau, R ;
Pan, M ;
Iloanusi, C ;
Kottemann, MCH ;
Hood, L ;
DiRuggiero, J .
GENOME RESEARCH, 2004, 14 (06) :1025-1035
[7]   Coordinate regulation of energy transduction modules in Halobacterium sp analyzed by a global systems approach [J].
Baliga, NS ;
Pan, M ;
Goo, YA ;
Yi, EC ;
Goodlett, DR ;
Dimitrov, K ;
Shannon, P ;
Aebersold, R ;
Ng, WV ;
Hood, L .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (23) :14913-14918
[8]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkr1065, 10.1093/nar/gkh121]
[9]   Pfam 3.1: 1313 multiple alignments and profile HMMs match the majority of proteins [J].
Bateman, A ;
Birney, E ;
Durbin, R ;
Eddy, SR ;
Finn, RD ;
Sonnhammer, ELL .
NUCLEIC ACIDS RESEARCH, 1999, 27 (01) :260-262
[10]   R391:: a conjugative integrating mosaic comprised of phage, plasmid, and transposon elements [J].
Böltner, D ;
MacMahon, C ;
Pembroke, JT ;
Strike, P ;
Osborn, AM .
JOURNAL OF BACTERIOLOGY, 2002, 184 (18) :5158-5169