Whole-genome phylogeny of Escherichia coli/Shigella group by feature frequency profiles (FFPs).

Proc Natl Acad Sci U S A 2011 May 2;108(20):8329-34. Epub 2011 May 2.

Department of Informatics, J. Craig Venter Institute, Rockville, MD 20850, USA.

A whole-genome phylogeny of the Escherichia coli/Shigella group was constructed by using the feature frequency profile (FFP) method. This alignment-free approach uses the frequencies of l-mer features of whole genomes to infer phylogenic distances. We present two phylogenies that accentuate different aspects of E. coli/Shigella genomic evolution: (i) one based on the compositions of all possible features of length l = 24 (∼8.4 million features), which are likely to reveal the phenetic grouping and relationship among the organisms and (ii) the other based on the compositions of core features with low frequency and low variability (∼0.56 million features), which account for ∼69% of all commonly shared features among 38 taxa examined and are likely to have genome-wide lineal evolutionary signal. Shigella appears as a single clade when all possible features are used without filtering of noncore features. However, results using core features show that Shigella consists of at least two distantly related subclades, implying that the subclades evolved into a single clade because of a high degree of convergence influenced by mobile genetic elements and niche adaptation. In both FFP trees, the basal group of the E. coli/Shigella phylogeny is the B2 phylogroup, which contains primarily uropathogenic strains, suggesting that the E. coli/Shigella ancestor was likely a facultative or opportunistic pathogen. The extant commensal strains diverged relatively late and appear to be the result of reductive evolution of genomes. We also identify clade distinguishing features and their associated genomic regions within each phylogroup. Such features may provide useful information for understanding evolution of the groups and for quick diagnostic identification of each phylogroup.

Download full-text PDF

Source
http://dx.doi.org/10.1073/pnas.1105168108DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3100984PMC
May 2011
1 Read

Publication Analysis

Top Keywords

features
11
core features
8
single clade
8
coli/shigella group
8
based compositions
8
feature frequency
8
whole-genome phylogeny
8
escherichia coli/shigella
8
phylogeny escherichia
8
coli/shigella
5
genome-wide lineal
4
lineal evolutionary
4
late appear
4
examined genome-wide
4
evolutionary signal
4
appear result
4
filtering noncore
4
clade features
4
features filtering
4
taxa examined
4

References

(Supplied by CrossRef)

BERGEYS MANUAL OF SYSTEMATIC BACTERIOLOGY 1984

Bardhan et al.
Emerging infectious diseases 2010

PNAS 2009

Lecointre et al.
Molecular Biology and Evolution 1998

1993

Wirth et al.
Molecular microbiology 2006

Wang et al.
Journal of Bacteriology 1997

Escobar-P  ramo et al.
Journal of Molecular Evolution 2003

Touchon et al.
2009

PNAS 2009

Venkatesan et al.
Infection and Immunity 2001

Similar Publications