CDD: specific functional annotation with the Conserved Domain Database

被引:875
作者
Marchler-Bauer, Aron [1 ]
Anderson, John B. [1 ]
Chitsaz, Farideh [1 ]
Derbyshire, Myra K. [1 ]
DeWeese-Scott, Carol [1 ]
Fong, Jessica H. [1 ]
Geer, Lewis Y. [1 ]
Geer, Renata C. [1 ]
Gonzales, Noreen R. [1 ]
Gwadz, Marc [1 ]
He, Siqian [1 ]
Hurwitz, David I. [1 ]
Jackson, John D. [1 ]
Ke, Zhaoxi [1 ]
Lanczycki, Christopher J. [1 ]
Liebert, Cynthia A. [1 ]
Liu, Chunlei [1 ]
Lu, Fu [1 ]
Lu, Shennan [1 ]
Marchler, Gabriele H. [1 ]
Mullokandov, Mikhail [1 ]
Song, James S. [1 ]
Tasneem, Asba [1 ]
Thanki, Narmada [1 ]
Yamashita, Roxanne A. [1 ]
Zhang, Dachuan [1 ]
Zhang, Naigong [1 ]
Bryant, Stephen H. [1 ]
机构
[1] Natl Lib Med, Natl Ctr Biotechnol Informat, NIH, Bethesda, MD 20894 USA
基金
美国国家卫生研究院;
关键词
ALIGNMENTS;
D O I
10.1093/nar/gkn845
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
NCBI's Conserved Domain Database (CDD) is a collection of multiple sequence alignments and derived database search models, which represent protein domains conserved in molecular evolution. The collection can be accessed at http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml, and is also part of NCBI's Entrez query and retrieval system, crosslinked to numerous other resources. CDD provides annotation of domain footprints and conserved functional sites on protein sequences. Precalculated domain annotation can be retrieved for protein sequences tracked in NCBI's Entrez system, and CDD's collection of models can be queried with novel protein sequences via the CD-Search service at http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi. Starting with the latest version of CDD, v2.14, information from redundant and homologous domain models is summarized at a superfamily level, and domain annotation on proteins is flagged as either 'specific' ( identifying molecular function with high confidence) or as 'non-specific' (identifying superfamily membership only).
引用
收藏
页码:D205 / D210
页数:6
相关论文
共 13 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   Data growth and its impact on the SCOP database: new developments [J].
Andreeva, Antonina ;
Howorth, Dave ;
Chandonia, John-Marc ;
Brenner, Steven E. ;
Hubbard, Tim J. P. ;
Chothia, Cyrus ;
Murzin, Alexey G. .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D419-D425
[3]   The Pfam protein families database [J].
Finn, Robert D. ;
Tate, John ;
Mistry, Jaina ;
Coggill, Penny C. ;
Sammut, Stephen John ;
Hotz, Hans-Rudolf ;
Ceric, Goran ;
Forslund, Kristoffer ;
Eddy, Sean R. ;
Sonnhammer, Erik L. L. ;
Bateman, Alex .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D281-D288
[4]  
FONG JH, 2008, BMC RES NOT IN PRESS
[5]   CDART: Protein homology by domain architecture [J].
Geer, LY ;
Domrachev, M ;
Lipman, DJ ;
Bryant, SH .
GENOME RESEARCH, 2002, 12 (10) :1619-1623
[6]   SMART 5: domains in the context of genomes and networks [J].
Letunic, Ivica ;
Copley, Richard R. ;
Pils, Birgit ;
Pinkert, Stefan ;
Schultz, Joerg ;
Bork, Peer .
NUCLEIC ACIDS RESEARCH, 2006, 34 :D257-D260
[7]  
MARCHLER B, 2005, NUCLEIC ACIDS RES, V32, pW327
[8]   CDD: a curated Entrez database of conserved domain alignments [J].
Marchler-Bauer, A ;
Anderson, JB ;
DeWeese-Scott, C ;
Fedorova, ND ;
Geer, LY ;
He, SQ ;
Hurwitz, DI ;
Jackson, JD ;
Jacobs, AR ;
Lanczycki, CJ ;
Liebert, CA ;
Liu, CL ;
Madej, T ;
Marchler, GH ;
Mazumder, R ;
Nikolskaya, AN ;
Panchenko, AR ;
Rao, BS ;
Shoemaker, BA ;
Simonyan, V ;
Song, JS ;
Thiessen, PA ;
Vasudevan, S ;
Wang, YL ;
Yamashita, RA ;
Yin, JJ ;
Bryant, SH .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :383-387
[9]   CDD: a conserved domain database for protein classification [J].
Marchler-Bauer, A ;
Anderson, JB ;
Cherukuri, PF ;
DeWweese-Scott, C ;
Geer, LY ;
Gwadz, M ;
He, SQ ;
Hurwitz, DI ;
Jackson, JD ;
Ke, ZX ;
Lanczycki, CJ ;
Liebert, CA ;
Liu, CL ;
Lu, F ;
Marchler, GH ;
Mullokandov, M ;
Shoemaker, BA ;
Simonyan, V ;
Song, JS ;
Thiessen, PA ;
Yamashita, RA ;
Yin, JJ ;
Zhang, DC ;
Bryant, SH .
NUCLEIC ACIDS RESEARCH, 2005, 33 :D192-D196
[10]   CDD: a database of conserved domain alignments with links to domain three-dimensional structure [J].
Marchler-Bauer, A ;
Panchenko, AR ;
Shoemaker, BA ;
Thiessen, PA ;
Geer, LY ;
Bryant, SH .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :281-283