A meta-analysis of the genomic and transcriptomic composition of complex life.

Cell Cycle 2013 Jul 6;12(13):2061-72. Epub 2013 Jun 6.

Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD Australia.

It is now clear that animal genomes are predominantly non-protein-coding, and that these sequences encode a wide array of RNA transcripts and other regulatory elements that are fundamental to the development of complex life. We have previously argued that the proportion of an animal genome that is non-protein-coding DNA (ncDNA) correlates well with its apparent biological complexity. Here we extend on that work and, using data from a total of 1,627 prokaryotic and 153 eukaryotic complete and annotated genomes, show that the proportion of ncDNA per haploid genome is significantly positively correlated with a previously published proxy of biological complexity, the number of distinct cell types. This is in contrast to the amount of the genome that encodes proteins, which we show is essentially unchanged across Metazoa. Furthermore, using a total of 179 RNA-seq data sets from nematode (47), fruit fly (72), zebrafish (20) and human (42), we show, consistent with other recent reports, that the vast majority of ncDNA in animals is transcribed. This includes more than 60 human loci previously considered "gene deserts," many of which are expressed tissue-specifically and associated with previously reported GWAS SNPs. These results suggest that ncDNA, and the ncRNAs encoded within it, may be intimately involved in the evolution, maintenance and development of complex life.

Download full-text PDF

Source
http://dx.doi.org/10.4161/cc.25134DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3737309PMC
July 2013
2 Reads

Publication Analysis

Top Keywords

complex life
12
development complex
8
biological complexity
8
positively correlated
4
genome positively
4
haploid genome
4
proportion ncdna
4
ncdna haploid
4
"gene deserts"
4
considered "gene
4
loci considered
4
complexity number
4
proxy biological
4
published proxy
4
correlated published
4
genomes proportion
4
complete annotated
4
total 1627
4
tissue-specifically associated
4
human consistent
4

Altmetric Statistics

References

(Supplied by CrossRef)
Chromosome organization and genic expression
McCLINTOCK et al.
Cold Spring Harb Symp Quant Biol 1951
The origin and behavior of mutable loci in maize
McCLINTOCK et al.
Proc Natl Acad Sci USA 1950
An integrated encyclopedia of DNA elements in the human genome
Consortium et al.
Nat Cell Biol 2012

Similar Publications