The Pfam protein families database

被引:1508
作者
Finn, Robert D. [1 ]
Mistry, Jaina [1 ]
Tate, John [1 ]
Coggill, Penny [1 ]
Heger, Andreas [2 ]
Pollington, Joanne E. [1 ]
Gavin, O. Luke [1 ]
Gunasekaran, Prasad [1 ]
Ceric, Goran [3 ]
Forslund, Kristoffer [4 ]
Holm, Liisa [5 ,6 ]
Sonnhammer, Erik L. L. [4 ]
Eddy, Sean R. [3 ]
Bateman, Alex [1 ]
机构
[1] Wellcome Trust Sanger Inst, Hinxton CB10 1SA, Cambs, England
[2] Univ Oxford, Dept Physiol Anat & Genet, MRC Funct Genom Unit, Oxford, England
[3] Howard Hughes Med Inst, Ashburn, VA 20147 USA
[4] Stockholm Univ, Stockholm Bioinformat Ctr, SE-10691 Stockholm, Sweden
[5] Univ Helsinki, Inst Biotechnol, Helsinki 00014, Finland
[6] Univ Helsinki, Dept Biol & Environm Sci, FIN-00014 Helsinki, Finland
基金
芬兰科学院; 英国生物技术与生命科学研究理事会; 英国惠康基金;
关键词
TREES; BLAST;
D O I
10.1093/nar/gkp985
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Pfam is a widely used database of protein families and domains. This article describes a set of major updates that we have implemented in the latest release (version 24.0). The most important change is that we now use HMMER3, the latest version of the popular profile hidden Markov model package. This software is similar to 100 times faster than HMMER2 and is more sensitive due to the routine use of the forward algorithm. The move to HMMER3 has necessitated numerous changes to Pfam that are described in detail. Pfam release 24.0 contains 11 912 families, of which a large number have been significantly updated during the past two years. Pfam is available via servers in the UK (http://pfam.sanger.ac.uk/), the USA (http://pfam.janelia.org/) and Sweden (http://pfam.sbc.su.se/).
引用
收藏
页码:D211 / D222
页数:12
相关论文
共 24 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   Data growth and its impact on the SCOP database: new developments [J].
Andreeva, Antonina ;
Howorth, Dave ;
Chandonia, John-Marc ;
Brenner, Steven E. ;
Hubbard, Tim J. P. ;
Chothia, Cyrus ;
Murzin, Alexey G. .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D419-D425
[3]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkh121, 10.1093/nar/gkr1065]
[4]   SCOOP: a simple method for identification of novel protein superfamily relationships [J].
Bateman, Alex ;
Finn, Robert D. .
BIOINFORMATICS, 2007, 23 (07) :809-814
[5]   ProDom and ProDom-CG: tools for protein domain analysis and whole genome comparisons [J].
Corpet, F ;
Servant, F ;
Gouzy, J ;
Kahn, D .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :267-269
[6]   BioLit: integrating biological literature with databases [J].
Fink, J. Lynn ;
Kushch, Sergey ;
Williams, Parker R. ;
Bourne, Philip E. .
NUCLEIC ACIDS RESEARCH, 2008, 36 :W385-W389
[7]   Pfam:: clans, web tools and services [J].
Finn, Robert D. ;
Mistry, Jaina ;
Schuster-Bockler, Benjamin ;
Griffiths-Jones, Sam ;
Hollich, Volker ;
Lassmann, Timo ;
Moxon, Simon ;
Marshall, Mhairi ;
Khanna, Ajay ;
Durbin, Richard ;
Eddy, Sean R. ;
Sonnhammer, Erik L. L. ;
Bateman, Alex .
NUCLEIC ACIDS RESEARCH, 2006, 34 :D247-D251
[8]   Rfam: updates to the RNA families database [J].
Gardner, Paul P. ;
Daub, Jennifer ;
Tate, John G. ;
Nawrocki, Eric P. ;
Kolbe, Diana L. ;
Lindgreen, Stinus ;
Wilkinson, Adam C. ;
Finn, Robert D. ;
Griffiths-Jones, Sam ;
Eddy, Sean R. ;
Bateman, Alex .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D136-D140
[9]  
Heger A, 2005, NUCLEIC ACIDS RES, V33, pD188
[10]   PairsDB atlas of protein sequence space [J].
Heger, Andreas ;
Korpelainen, Eija ;
Hupponen, Taavi ;
Mattila, Kimmo ;
Ollikainen, Vesa ;
Holm, Liisa .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D276-D280