Inter-dinucleotide distances in the human genome: an analysis of the whole-genome and protein-coding distributions.

J Integr Bioinform 2011 Sep 15;8(3):172. Epub 2011 Sep 15.

Signal Processing Lab, IEETA, University of Aveiro, 3810-193 Aveiro, Portugal.

We study the inter-dinucleotide distance distributions in the human genome, both in the whole-genome and protein-coding regions. The inter-dinucleotide distance is defined as the distance to the next occurrence of the same dinucleotide. We consider the 16 sequences of inter-dinucleotide distances and two reading frames. Our results show a period-3 oscillation in the protein-coding inter-dinucleotide distance distributions that is absent from the whole-genome distributions. We also compare the distance distribution of each dinucleotide to a reference distribution, that of a random sequence generated with the same dinucleotide abundances, revealing the CG dinucleotide as the one with the highest cumulative relative error for the first 60 distances. Moreover, the distance distribution of each dinucleotide is compared to the distance distribution of all other dinucleotides using the Kullback-Leibler divergence. We find that the distance distribution of a dinucleotide and that of its reversed complement are very similar, hence, the divergence between them is very small. This is an interesting finding that may give evidence of a stronger parity rule than Chargaff's second parity rule.

Download full-text PDF

Source
http://dx.doi.org/10.2390/biecoll-jib-2011-172DOI Listing
September 2011
9 Reads

Publication Analysis

Top Keywords

distance distribution
16
inter-dinucleotide distance
12
distribution dinucleotide
12
whole-genome protein-coding
8
distance distributions
8
distance
8
parity rule
8
inter-dinucleotide distances
8
human genome
8
dinucleotide
6
inter-dinucleotide
5
distribution
5
dinucleotide abundances
4
cumulative relative
4
abundances revealing
4
revealing dinucleotide
4
highest cumulative
4
dinucleotide highest
4
relative error
4
distribution random
4

Similar Publications