Error rates for nanopore discrimination among cytosine, methylcytosine, and hydroxymethylcytosine along individual DNA strands.

Proc Natl Acad Sci U S A 2013 Nov 28;110(47):18910-5. Epub 2013 Oct 28.

Nanopore Group, Department of Biomolecular Engineering, University of California, Santa Cruz, CA 95064.

Cytosine, 5-methylcytosine, and 5-hydroxymethylcytosine were identified during translocation of single DNA template strands through a modified Mycobacterium smegmatis porin A (M2MspA) nanopore under control of phi29 DNA polymerase. This identification was based on three consecutive ionic current states that correspond to passage of modified or unmodified CG dinucleotides and their immediate neighbors through the nanopore limiting aperture. To establish quality scores for these calls, we examined ~3,300 translocation events for 48 distinct DNA constructs. Each experiment analyzed a mixture of cytosine-, 5-methylcytosine-, and 5-hydroxymethylcytosine-bearing DNA strands that contained a marker that independently established the correct cytosine methylation status at the target CG of each molecule tested. To calculate error rates for these calls, we established decision boundaries using a variety of machine-learning methods. These error rates depended upon the identity of the bases immediately 5' and 3' of the targeted CG dinucleotide, and ranged from 1.7% to 12.2% for a single-pass read. We estimate that Q40 values (0.01% error rates) for methylation status calls could be achieved by reading single molecules 5-19 times depending upon sequence context.

Download full-text PDF

Source
http://dx.doi.org/10.1073/pnas.1310615110DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3839712PMC
November 2013

Publication Analysis

Top Keywords

error rates
16
methylation status
8
dna strands
8
dna
5
limiting aperture
4
molecule tested
4
single-pass read
4
nanopore limiting
4
aperture establish
4
establish quality
4
17% 122%
4
calls examined
4
target molecule
4
scores calls
4
quality scores
4
machine-learning methods
4
methods error
4
122% single-pass
4
dinucleotides neighbors
4
tested calculate
4

Similar Publications