KAAS: an automatic genome annotation and pathway reconstruction server

被引:3277
作者
Moriya, Yuki [1 ]
Itoh, Masumi [1 ]
Okuda, Shujiro [1 ]
Yoshizawa, Akiyasu C. [1 ]
Kanehisa, Minoru [1 ]
机构
[1] Kyoto Univ, Inst Chem Res, Bioinformat Ctr, Kyoto 6110011, Japan
基金
日本科学技术振兴机构;
关键词
D O I
10.1093/nar/gkm321
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The number of complete and draft genomes is rapidly growing in recent years, and it has become increasingly important to automate the identification of functional properties and biological roles of genes in these genomes. In the KEGG database, genes in complete genomes are annotated with the KEGG orthology (KO) identifiers, or the K numbers, based on the best hit information using Smith Waterman scores as well as by the manual curation. Each K number represents an ortholog group of genes, and it is directly linked to an object in the KEGG pathway map or the BRITE functional hierarchy. Here, we have developed a web-based server called KAAS (KEGG Automatic Annotation Server: http://www.genome.jp/kegg/ kaas/) i.e. an implementation of a rapid method to automatically assign K numbers to genes in the genome, enabling reconstruction of KEGG pathways and BRITE hierarchies. The method is based on sequence similarities, bi-directional best hit information and some heuristics, and has achieved a high degree of accuracy when compared with the manually curated KEGG GENES database.
引用
收藏
页码:W182 / W185
页数:4
相关论文
共 10 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[3]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[4]   From genomics to chemical genomics: new developments in KEGG [J].
Kanehisa, Minoru ;
Goto, Susumu ;
Hattori, Masahiro ;
Aoki-Kinoshita, Kiyoko F. ;
Itoh, Masumi ;
Kawashima, Shuichi ;
Katayama, Toshiaki ;
Araki, Michihiro ;
Hirakawa, Mika .
NUCLEIC ACIDS RESEARCH, 2006, 34 :D354-D357
[5]   RAPID AND SENSITIVE PROTEIN SIMILARITY SEARCHES [J].
LIPMAN, DJ ;
PEARSON, WR .
SCIENCE, 1985, 227 (4693) :1435-1441
[6]   Enzyme function less conserved than anticipated [J].
Rost, B .
JOURNAL OF MOLECULAR BIOLOGY, 2002, 318 (02) :595-608
[7]   IDENTIFICATION OF COMMON MOLECULAR SUBSEQUENCES [J].
SMITH, TF ;
WATERMAN, MS .
JOURNAL OF MOLECULAR BIOLOGY, 1981, 147 (01) :195-197
[8]   A genomic perspective on protein families [J].
Tatusov, RL ;
Koonin, EV ;
Lipman, DJ .
SCIENCE, 1997, 278 (5338) :631-637
[9]   The COG database: new developments in phylogenetic classification of proteins from complete genomes [J].
Tatusov, RL ;
Natale, DA ;
Garkavtsev, IV ;
Tatusova, TA ;
Shankavaram, UT ;
Rao, BS ;
Kiryutin, B ;
Galperin, MY ;
Fedorova, ND ;
Koonin, EV .
NUCLEIC ACIDS RESEARCH, 2001, 29 (01) :22-28
[10]   How well is enzyme function conserved as a function of pairwise sequence identity? [J].
Tian, WD ;
Skolnick, J .
JOURNAL OF MOLECULAR BIOLOGY, 2003, 333 (04) :863-882