Sunbeam: an extensible pipeline for analyzing metagenomic sequencing experiments.

Microbiome 2019 03 22;7(1):46. Epub 2019 Mar 22.

Division of Gastroenterology, Hepatology and Nutrition, The Children's Hospital of Philadelphia, Philadelphia, PA, 19104, USA.

Background: Analysis of mixed microbial communities using metagenomic sequencing experiments requires multiple preprocessing and analytical steps to interpret the microbial and genetic composition of samples. Analytical steps include quality control, adapter trimming, host decontamination, metagenomic classification, read assembly, and alignment to reference genomes.

Results: We present a modular and user-extensible pipeline called Sunbeam that performs these steps in a consistent and reproducible fashion. It can be installed in a single step, does not require administrative access to the host computer system, and can work with most cluster computing frameworks. We also introduce Komplexity, a software tool to eliminate potentially problematic, low-complexity nucleotide sequences from metagenomic data. A unique component of the Sunbeam pipeline is an easy-to-use extension framework that enables users to add custom processing or analysis steps directly to the workflow. The pipeline and its extension framework are well documented, in routine use, and regularly updated.

Conclusions: Sunbeam provides a foundation to build more in-depth analyses and to enable comparisons in metagenomic sequencing experiments by removing problematic, low-complexity reads and standardizing post-processing and analytical steps. Sunbeam is written in Python using the Snakemake workflow management software and is freely available at github.com/sunbeam-labs/sunbeam under the GPLv3.

Download full-text PDF

Source
http://dx.doi.org/10.1186/s40168-019-0658-xDOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6429786PMC
March 2019
2 Reads

Publication Analysis

Top Keywords

sequencing experiments
12
analytical steps
12
metagenomic sequencing
12
problematic low-complexity
8
extension framework
8
sunbeam
5
steps
5
metagenomic
5
pipeline extension
4
installed single
4
fashion installed
4
analyses enable
4
reproducible fashion
4
easy-to-use extension
4
require administrative
4
step require
4
single step
4
consistent reproducible
4
in-depth analyses
4
build in-depth
4

Altmetric Statistics

References

(Supplied by CrossRef)

PJ Turnbaugh et al.
Nature. 2007

PJ Turnbaugh et al.
Nature. 2006

XC Morgan et al.
Genome Biol 2012

STM Lee et al.
Microbiome 2017

EA Dinsdale et al.
Nature 2008

T Yatsunenko et al.
Nature 2012

N Fierera et al.
Proc Natl Acad Sci 2012

M Breitbart et al.
J Bacteriol 2003

RA Edwards et al.
Nat Rev Microbiol 2005

Similar Publications