GenomeTester4: a toolkit for performing basic set operations - union, intersection and complement on k-mer lists.

Gigascience 2015 3;4:58. Epub 2015 Dec 3.

Department of Bioinformatics, University of Tartu, Riia 23, Tartu, 51010 Estonia ; Estonian Biocentre, Riia 23B, Tartu, 51010 Estonia.

Background: K-mer-based methods of genome analysis have attracted great interest because they do not require genome assembly and can be performed directly on sequencing reads. Many analysis tasks require one to compare k-mer lists from different sequences to find words that are either unique to a specific sequence or common to many sequences. However, no stand-alone k-mer analysis tool currently allows one to perform these algebraic set operations.

Findings: We have developed the GenomeTester4 toolkit, which contains a novel tool GListCompare for performing union, intersection and complement (difference) set operations on k-mer lists. We provide examples of how these general operations can be combined to solve a variety of biological analysis tasks.

Conclusions: GenomeTester4 can be used to simplify k-mer list manipulation for many biological analysis tasks.

Download full-text PDF

Source
http://dx.doi.org/10.1186/s13742-015-0097-yDOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4669650PMC
July 2016
2 Reads

Publication Analysis

Top Keywords

k-mer lists
12
set operations
8
intersection complement
8
biological analysis
8
union intersection
8
genometester4 toolkit
8
analysis tasks
8
k-mer
5
analysis
5
tool currently
4
allows perform
4
currently allows
4
algebraic set
4
toolkit novel
4
developed genometester4
4
operationsfindings developed
4
set operationsfindings
4
perform algebraic
4
stand-alone k-mer
4
find unique
4

Similar Publications