CDART: Protein homology by domain architecture

被引:557
作者
Geer, LY [1 ]
Domrachev, M [1 ]
Lipman, DJ [1 ]
Bryant, SH [1 ]
机构
[1] NIH, Natl Ctr Biotechnol Informat, Natl Lib Med, Bethesda, MD 20894 USA
关键词
D O I
10.1101/gr.278202
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The Conserved Domain Architecture Retrieval Tool (CDART) performs similarity searches of the NCBl Entrez Protein Database based on domain architecture, defined as the sequential order of conserved domains in proteins. The algorithm finds protein similarities across significant evolutionary distances using sensitive protein domain profiles rather than by direct sequence similarity. Proteins similar to a query protein are grouped and scored by architecture. Relying on domain profiles allows CDART to be fast, and, because it relies on annotated functional domains, informative. Domain profiles are derived from several collections of domain definitions that include functional annotation. Searches can be further refined by taxonomy and by selecting domains of interest. CDART is available at http://www.ncbi.nlm.nih.gov/Structure/lexington/lexington.cgi.
引用
收藏
页码:1619 / 1623
页数:5
相关论文
共 10 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   The InterPro database, an integrated documentation resource for protein families, domains and functional sites [J].
Apweiler, R ;
Attwood, TK ;
Bairoch, A ;
Bateman, A ;
Birney, E ;
Biswas, M ;
Bucher, P ;
Cerutti, T ;
Corpet, F ;
Croning, MDR ;
Durbin, R ;
Falquet, L ;
Fleischmann, W ;
Gouzy, J ;
Hermjakob, H ;
Hulo, N ;
Jonassen, I ;
Kahn, D ;
Kanapin, A ;
Karavidopoulou, Y ;
Lopez, R ;
Marx, B ;
Mulder, NJ ;
Oinn, TM ;
Pagni, M ;
Servant, F ;
Sigrist, CJA ;
Zdobnov, EM .
NUCLEIC ACIDS RESEARCH, 2001, 29 (01) :37-40
[3]   The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 [J].
Bairoch, A ;
Apweiler, R .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :45-48
[4]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkh121, 10.1093/nar/gkr1065]
[5]   A superfamily of conserved domains in DNA damage responsive cell cycle checkpoint proteins [J].
Bork, P ;
Hofmann, K ;
Bucher, P ;
Neuwald, AF ;
Altschul, SF ;
Koonin, EV .
FASEB JOURNAL, 1997, 11 (01) :68-76
[6]   Genomic organization of Drosophila poly(ADP-ribose) polymerase and distribution of its mRNA during development [J].
Hanai, S ;
Uchida, M ;
Kobayashi, S ;
Miwa, M ;
Uchida, K .
JOURNAL OF BIOLOGICAL CHEMISTRY, 1998, 273 (19) :11881-11886
[7]   Recent improvements to the SMART domain-based sequence annotation resource [J].
Letunic, I ;
Goodstadt, L ;
Dickens, NJ ;
Doerks, T ;
Schultz, J ;
Mott, R ;
Ciccarelli, F ;
Copley, RR ;
Ponting, CP ;
Bork, P .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :242-244
[8]   CDD: a database of conserved domain alignments with links to domain three-dimensional structure [J].
Marchler-Bauer, A ;
Panchenko, AR ;
Shoemaker, BA ;
Thiessen, PA ;
Geer, LY ;
Bryant, SH .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :281-283
[9]  
Ponting CP, 2001, GENOME BIOL, V2
[10]   Database resources of the National Center for Biotechnology Information: 2002 update [J].
Wheeler, DL ;
Church, DM ;
Lash, AE ;
Leipe, DD ;
Madden, TL ;
Pontius, JU ;
Schuler, GD ;
Schriml, LM ;
Tatusova, TA ;
Wagner, L ;
Rapp, BA .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :13-16