Principles of metadata organization at the ENCODE data coordination center.

Database (Oxford) 2016 15;2016. Epub 2016 Mar 15.

Department of Genetics, Stanford University School of Medicine Department of Genetics, Stanford, CA, USA

The Encyclopedia of DNA Elements (ENCODE) Data Coordinating Center (DCC) is responsible for organizing, describing and providing access to the diverse data generated by the ENCODE project. The description of these data, known as metadata, includes the biological sample used as input, the protocols and assays performed on these samples, the data files generated from the results and the computational methods used to analyze the data. Here, we outline the principles and philosophy used to define the ENCODE metadata in order to create a metadata standard that can be applied to diverse assays and multiple genomic projects. In addition, we present how the data are validated and used by the ENCODE DCC in creating the ENCODE Portal (https://www.encodeproject.org/). Database URL: www.encodeproject.org.

Download full-text PDF

Source
http://dx.doi.org/10.1093/database/baw001DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4792520PMC
October 2016
23 Reads

Publication Analysis

Top Keywords

encode data
8
data
7
encode
6
performed samples
4
projects addition
4
protocols assays
4
assays performed
4
samples data
4
genomic projects
4
files generated
4
data files
4
input protocols
4
biological sample
4
metadata includes
4
encode dcc
4
data metadata
4
validated encode
4
includes biological
4
sample input
4
generated computational
4

Similar Publications