Genome Modeling System: A Knowledge Management Platform for Genomics.

PLoS Comput Biol 2015 Jul 9;11(7):e1004274. Epub 2015 Jul 9.

The Genome Institute, Washington University in St. Louis, St. Louis, Missouri, United States of America; Department of Genetics, Washington University School of Medicine, St. Louis, Missouri, United States of America; Department of Medicine, Washington University School of Medicine, St. Louis, Missouri, United States of America; Siteman Cancer Center, Washington University School of Medicine, St. Louis, Missouri, United States of America; Department of Molecular Microbiology, Washington University School of Medicine, St. Louis, Missouri, United States of America.

In this work, we present the Genome Modeling System (GMS), an analysis information management system capable of executing automated genome analysis pipelines at a massive scale. The GMS framework provides detailed tracking of samples and data coupled with reliable and repeatable analysis pipelines. The GMS also serves as a platform for bioinformatics development, allowing a large team to collaborate on data analysis, or an individual researcher to leverage the work of others effectively within its data management system. Rather than separating ad-hoc analysis from rigorous, reproducible pipelines, the GMS promotes systematic integration between the two. As a demonstration of the GMS, we performed an integrated analysis of whole genome, exome and transcriptome sequencing data from a breast cancer cell line (HCC1395) and matched lymphoblastoid line (HCC1395BL). These data are available for users to test the software, complete tutorials and develop novel GMS pipeline configurations. The GMS is available at https://github.com/genome/gms.

Download full-text PDF

Source
http://dx.doi.org/10.1371/journal.pcbi.1004274DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4497734PMC
July 2015
105 Reads

Publication Analysis

Top Keywords

management system
8
modeling system
8
analysis pipelines
8
genome modeling
8
pipelines gms
8
gms
7
analysis
6
data
5
analysis rigorous
4
ad-hoc analysis
4
separating ad-hoc
4
allowing large
4
rigorous reproducible
4
gms promotes
4
development allowing
4
demonstration gms
4
integration demonstration
4
systematic integration
4
promotes systematic
4
reproducible pipelines
4

Similar Publications