Genome and Transcriptome Assembly of the Canadian Beaver ().

G3 (Bethesda) 2017 02 9;7(2):755-773. Epub 2017 Feb 9.

The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada

The Canadian beaver () is the largest indigenous rodent in North America. We report a draft annotated assembly of the beaver genome, the first for a large rodent and the first mammalian genome assembled directly from uncorrected and moderate coverage (< 30 ×) long reads generated by single-molecule sequencing. The genome size is 2.7 Gb estimated by k-mer analysis. We assembled the beaver genome using the new Canu assembler optimized for noisy reads. The resulting assembly was refined using Pilon supported by short reads (80 ×) and checked for accuracy by congruency against an independent short read assembly. We scaffolded the assembly using the exon-gene models derived from 9805 full-length open reading frames (FL-ORFs) constructed from the beaver leukocyte and muscle transcriptomes. The final assembly comprised 22,515 contigs with an N50 of 278,680 bp and an N50-scaffold of 317,558 bp. Maximum contig and scaffold lengths were 3.3 and 4.2 Mb, respectively, with a combined scaffold length representing 92% of the estimated genome size. The completeness and accuracy of the scaffold assembly was demonstrated by the precise exon placement for 91.1% of the 9805 assembled FL-ORFs and 83.1% of the BUSCO (Benchmarking Universal Single-Copy Orthologs) gene set used to assess the quality of genome assemblies. Well-represented were genes involved in dentition and enamel deposition, defining characteristics of rodents with which the beaver is well-endowed. The study provides insights for genome assembly and an important genomics resource for Castoridae and rodent evolutionary biology.

Download full-text PDF

Source
http://dx.doi.org/10.1534/g3.116.038208DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5295618PMC
February 2017
92 Reads

Publication Analysis

Top Keywords

beaver genome
8
canadian beaver
8
genome
8
genome size
8
assembly
8
beaver
6
genes involved
4
exon-gene models
4
accuracy scaffold
4
assembly exon-gene
4
scaffolded assembly
4
completeness accuracy
4
well-represented genes
4
full-length open
4
assemblies well-represented
4
open reading
4
size completeness
4
9805 full-length
4
derived 9805
4
assembly scaffolded
4

References

(Supplied by CrossRef)
FGFs and BMP4 induce both Msx1-independent and Msx1-dependent signaling pathways in early tooth development.
Bei et al.
Development 1998

Similar Publications