Target inference from collections of genomic intervals.

Proc Natl Acad Sci U S A 2013 Jun 6;110(25):E2271-8. Epub 2013 Jun 6.

Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA.

Finding regions of the genome that are significantly recurrent in noisy data are a common but difficult problem in present day computational biology. Cores of recurrent events (CORE) is a computational approach to solving this problem that is based on a formalized notion by which "core" intervals explain the observed data, where the number of cores is the "depth" of the explanation. Given that formalization, we implement CORE as a combinatorial optimization procedure with depth chosen from considerations of statistical significance. An important feature of CORE is its ability to explain data with cores of widely varying lengths. We examine the performance of this system with synthetic data, and then provide two demonstrations of its utility with actual data. Applying CORE to a collection of DNA copy number profiles from single cells of a given tumor, we determine tumor population phylogeny and find the features that separate subpopulations. Applying CORE to comparative genomic hybridization data from a large set of tumor samples, we define regions of recurrent copy number aberration in breast cancer.

Download full-text PDF

Source
http://dx.doi.org/10.1073/pnas.1306909110DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3690846PMC
June 2013
36 Reads

Publication Analysis

Top Keywords

applying core
8
copy number
8
data
6
core
5
formalization implement
4
genomic hybridization
4
implement core
4
collection dna
4
core comparative
4
optimization procedure
4
combinatorial optimization
4
comparative genomic
4
core combinatorial
4
explanation formalization
4
"depth" explanation
4
explain observed
4
observed data
4
utility actual
4
intervals explain
4
synthetic data
4

References

(Supplied by CrossRef)

Hicks et al.
Genome Research 2006

Navin et al.
Nature 2011

OPER RES LETT 1991

Russnes et al.
Science Translational Medicine 2010

2001

Science 2007

Bioinformatics 2007

Xue et al.
Genes & Development 2008

Floer et al.
Cell 2010

PNAS 2012

PNAS 2007

Similar Publications