Comparing Reverse Complementary Genomic Words Based on Their Distance Distributions and Frequencies.

Interdiscip Sci 2018 Mar 6;10(1):1-11. Epub 2017 Dec 6.

Department of Mathematics and CIDMA and iBiMED and IEETA, University of Aveiro, Aveiro, Portugal.

In this work, we study reverse complementary genomic word pairs in the human DNA, by comparing both the distance distribution and the frequency of a word to those of its reverse complement. Several measures of dissimilarity between distance distributions are considered, and it is found that the peak dissimilarity works best in this setting. We report the existence of reverse complementary word pairs with very dissimilar distance distributions, as well as word pairs with very similar distance distributions even when both distributions are irregular and contain strong peaks. The association between distribution dissimilarity and frequency discrepancy is also explored, and it is speculated that symmetric pairs combining low and high values of each measure may uncover features of interest. Taken together, our results suggest that some asymmetries in the human genome go far beyond Chargaff's rules. This study uses both the complete human genome and its repeat-masked version.

Download full-text PDF

Source
http://dx.doi.org/10.1007/s12539-017-0273-0DOI Listing
March 2018
7 Reads

Publication Analysis

Top Keywords

distance distributions
16
reverse complementary
12
word pairs
12
complementary genomic
8
human genome
8
distributions
5
distance
5
dissimilar distance
4
pairs dissimilar
4
well word
4
asymmetries human
4
distributions distributions
4
pairs distance
4
complementary word
4
distributions well
4
setting report
4
dissimilarity works
4
chargaff's rules
4
peak dissimilarity
4
works best
4

References

(Supplied by CrossRef)

V Afreixo et al.
Bioinformatics 2009

AH Tavares et al.
Proc ICPRAM 2015

DR Forsdyke et al.
Gene 2000

G Albrecht-Buehler et al.
Genomics 2007

PF Baisnée et al.
Bioinformatics 2002

B Inagaki et al.
Front Genet 2016

V Afreixo et al.
J Theor Biol 2013

V Afreixo et al.
Biostatistics 2015

G Albrecht-Buehler et al.
Proc Natl Acad Sci 2006

AH Tavares et al.
2017

Similar Publications