Detection of Regional Variation in Selection Intensity within Protein-Coding Genes Using DNA Sequence Polymorphism and Divergence.

Authors:
Zi-Ming Zhao
Zi-Ming Zhao
Guangdong Provincial Institute of Chinese Medicine
Foshan | China
Michael C Campbell
Michael C Campbell
University of Pennsylvania
Philadelphia | United States
Ning Li
Ning Li
The Fish Molecular Genetics and Biotechnology Laboratory
Auburn | United States
Zhang Zhang
Zhang Zhang
West China Hospital
Nanchong Shi | China
Prof. Jeffrey P Townsend, Ph.D.
Prof. Jeffrey P Townsend, Ph.D.
Yale University
Elihu Professor of Biostatistics and Ecology & Evolutionary Biology
New Haven, CT | United States

Mol Biol Evol 2017 Nov;34(11):3006-3022

Department of Biostatistics, Yale University, New Haven, CT.

Numerous approaches have been developed to infer natural selection based on the comparison of polymorphism within species and divergence between species. These methods are especially powerful for the detection of uniform selection operating across a gene. However, empirical analyses have demonstrated that regions of protein-coding genes exhibiting clusters of amino acid substitutions are subject to different levels of selection relative to other regions of the same gene. To quantify this heterogeneity of selection within coding sequences, we developed Model Averaged Site Selection via Poisson Random Field (MASS-PRF). MASS-PRF identifies an ensemble of intragenic clustering models for polymorphic and divergent sites. This ensemble of models is used within the Poisson Random Field framework to estimate selection intensity on a site-by-site basis. Using simulations, we demonstrate that MASS-PRF has high power to detect clusters of amino acid variants in small genic regions, can reliably estimate the probability of a variant occurring at each nucleotide site in sequence data and is robust to historical demographic trends and recombination. We applied MASS-PRF to human gene polymorphism derived from the 1,000 Genomes Project and divergence data from the common chimpanzee. On the basis of this analysis, we discovered striking regional variation in selection intensity, indicative of positive or negative selection, in well-defined domains of genes that have previously been associated with neurological processing, immunity, and reproduction. We suggest that amino acid-altering substitutions within these regions likely are or have been selectively advantageous in the human lineage, playing important roles in protein function.

Abstract Video

Download full-text PDF

Source
http://dx.doi.org/10.1093/molbev/msx213DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5850860PMC

Still can't find the full text of the article?

We can help you send a request to the authors directly.
November 2017
319 Reads

Article Mentions


Provided by Crossref Event Data
twitter
Twitter: JeffTownsend
December 5, 2018, 9:45 pm EST
twitter
Twitter: JeffTownsend
May 5, 2018, 12:48 am EST
twitter
Twitter: Random_hike
December 17, 2017, 5:18 pm EST
twitter
Twitter: EvolutionPirate
December 17, 2017, 10:25 am EST
twitter
Twitter: ArthursEvo
December 17, 2017, 4:46 am EST
twitter
Twitter: Jlwalker97
December 17, 2017, 1:00 am EST
twitter
Twitter: Tylervkent
December 16, 2017, 11:51 pm EST
twitter
Twitter: Pandolfatto
December 16, 2017, 10:37 pm EST
twitter
Twitter: Dbasanta
December 8, 2017, 5:13 pm EST
twitter
Twitter: JeffTownsend
December 8, 2017, 5:10 pm EST
twitter
Twitter: GamerGeekNews
December 8, 2017, 5:10 pm EST
twitter
Twitter: JeffTownsend
December 8, 2017, 5:09 pm EST

Publication Analysis

Top Keywords

selection intensity
12
selection
9
amino acid
8
protein-coding genes
8
random field
8
variation selection
8
clusters amino
8
regional variation
8
poisson random
8
small genic
4
variants small
4
acid variants
4
high power
4
mass-prf high
4
genic regions
4
power detect
4
detect clusters
4
variant occurring
4
nucleotide site
4
site sequence
4

References

(Supplied by CrossRef)
Synonymous codon usage in Drosophila melanogaster: natural selection and translational accuracy
Akashi et al.
Genetics 1994
Inferring weak selection from patterns of polymorphism and divergence at “silent” sites in Drosophila DNA
Akashi et al.
Genetics 1995

Bakewell et al.
2008
Genomic insights into positive selection
Biswas et al.
Trends Genet. 2006

Similar Publications