The complete genome sequence of Escherichia coli K-12

被引:5884
作者
Blattner, FR
Plunkett, G
Bloch, CA
Perna, NT
Burland, V
Riley, M
ColladoVides, J
Glasner, JD
Rode, CK
Mayhew, GF
Gregor, J
Davis, NW
Kirkpatrick, HA
Goeden, MA
Rose, DJ
Mau, B
Shao, Y
机构
[1] UNIV MICHIGAN, SCH MED, DEPT PEDIAT, ANN ARBOR, MI 48105 USA
[2] FMC BIOPROD, ROCKLAND, ME 04841 USA
[3] MARINE BIOL LABS, WOODS HOLE, MA 02543 USA
[4] UNIV NACL AUTONOMA MEXICO, CTR INVEST FIJAC NITROGENO, CUERNAVACA 62100, MORELOS, MEXICO
关键词
D O I
10.1126/science.277.5331.1453
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The 4,639,221-base pair sequence of Escherichia coli K-12 is presented. Of 4288 protein-coding genes annotated, 38 percent have no attributed function. Comparison with five other sequenced microbes reveals ubiquitous as well as narrowly distributed gene families; many families of similar genes within E. coli are also evident. The largest family of paralogous proteins contains 80 ABC transporters. The genome as a whole is strikingly organized with respect to the local direction of replication; guanines, oigonucleotides possibly related to replication and recombination, and most genes are so oriented. The genome also contains insertion sequence (IS) elements, phage remnants, and many other patches of unusual composition indicating genome plasticity through horizontal transfer.
引用
收藏
页码:1453 / +
页数:1
相关论文
共 105 条
[1]   CLONING, NUCLEOTIDE-SEQUENCE, AND EXPRESSION OF THE PASTEURELLA-HAEMOLYTICA A1 GLYCOPROTEASE GENE [J].
ABDULLAH, KM ;
LO, RYC ;
MELLORS, A .
JOURNAL OF BACTERIOLOGY, 1991, 173 (18) :5597-5603
[2]  
Aiba H, 1996, DNA Res, V3, P363, DOI 10.1093/dnares/3.6.363
[3]  
ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
[4]  
[Anonymous], 1996, ESCHERICHIA COLI SAL
[5]  
Bachellier S, 1996, CELLULAR MOL BIOL, VII, P2012
[6]  
BACHMANN BJ, 1996, CELLULAR MOL BIOL, V2, P2460
[7]   The SWISS-PROT protein sequence data bank and its new supplement TREMBL [J].
Bairoch, A ;
Apweiler, R .
NUCLEIC ACIDS RESEARCH, 1996, 24 (01) :21-25
[8]   DNA MISMATCH CORRECTION BY VERY SHORT PATCH REPAIR MAY HAVE ALTERED THE ABUNDANCE OF OLIGONUCLEOTIDES IN THE ESCHERICHIA-COLI GENOME [J].
BHAGWAT, AS ;
MCCLELLAND, M .
NUCLEIC ACIDS RESEARCH, 1992, 20 (07) :1663-1668
[9]   SIGNIFICANT DISPERSED RECURRENT DNA-SEQUENCES IN THE ESCHERICHIA-COLI GENOME - SEVERAL NEW GROUPS [J].
BLAISDELL, BE ;
RUDD, KE ;
MATIN, A ;
KARLIN, S .
JOURNAL OF MOLECULAR BIOLOGY, 1993, 229 (04) :833-848
[10]   BIOLOGICAL FRONTIERS [J].
BLATTNER, FR .
SCIENCE, 1983, 222 (4625) :719-720