The complete genome sequence of Escherichia coli K-12.

Science 1997 Sep;277(5331):1453-62

Laboratory of Genetics, University of Wisconsin-Madison, 445 Henry Mall, Madison, WI 53706, USA.

The 4,639,221-base pair sequence of Escherichia coli K-12 is presented. Of 4288 protein-coding genes annotated, 38 percent have no attributed function. Comparison with five other sequenced microbes reveals ubiquitous as well as narrowly distributed gene families; many families of similar genes within E. coli are also evident. The largest family of paralogous proteins contains 80 ABC transporters. The genome as a whole is strikingly organized with respect to the local direction of replication; guanines, oligonucleotides possibly related to replication and recombination, and most genes are so oriented. The genome also contains insertion sequence (IS) elements, phage remnants, and many other patches of unusual composition indicating genome plasticity through horizontal transfer.

Download full-text PDF

Source
http://dx.doi.org/10.1126/science.277.5331.1453DOI Listing
September 1997
4 Reads

Publication Analysis

Top Keywords

escherichia coli
8
sequence escherichia
8
coli k-12
8
distributed gene
4
elements phage
4
narrowly distributed
4
gene families
4
families families
4
genes coli
4
families genes
4
respect local
4
well narrowly
4
remnants patches
4
guanines oligonucleotides
4
ubiquitous well
4
microbes reveals
4
replication recombination
4
phage remnants
4
reveals ubiquitous
4
evident largest
4

Similar Publications