An R package that automatically collects and archives details for reproducible computing.

BMC Bioinformatics 2014 May 10;15:138. Epub 2014 May 10.

Department of Biostatistics, St Jude Children's Research Hospital, Memphis TN 38105, USA.

Background: It is scientifically and ethically imperative that the results of statistical analysis of biomedical research data be computationally reproducible in the sense that the reported results can be easily recapitulated from the study data. Some statistical analyses are computationally a function of many data files, program files, and other details that are updated or corrected over time. In many applications, it is infeasible to manually maintain an accurate and complete record of all these details about a particular analysis.

Results: Therefore, we developed the rctrack package that automatically collects and archives read only copies of program files, data files, and other details needed to computationally reproduce an analysis.

Conclusions: The rctrack package uses the trace function to temporarily embed detail collection procedures into functions that read files, write files, or generate random numbers so that no special modifications of the primary R program are necessary. At the conclusion of the analysis, rctrack uses these details to automatically generate a read only archive of data files, program files, result files, and other details needed to recapitulate the analysis results. Information about this archive may be included as an appendix of a report generated by Sweave or knitR. Here, we describe the usage, implementation, and other features of the rctrack package. The rctrack package is freely available from http://www.stjuderesearch.org/site/depts/biostats/rctrack under the GPL license.

Download full-text PDF

Source
http://bmcbioinformatics.biomedcentral.com/articles/10.1186/
Publisher Site
http://dx.doi.org/10.1186/1471-2105-15-138DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4026591PMC
May 2014
31 Reads

Publication Analysis

Top Keywords

rctrack package
16
data files
12
files details
12
program files
12
files
9
files program
8
details needed
8
collects archives
8
package automatically
8
automatically collects
8
details
6
package
5
rctrack
5
data
5
collection procedures
4
detail collection
4
read files
4
files generate
4
generate random
4
write files
4

References

(Supplied by CrossRef)

ScienceExchange et al.
2012

CM Micheel et al.
2012

LM McShane et al.
Nature 2013

LM McShane et al.
BMC Med 2013

NationalScienceBoard et al.
2011

KA Baggerly et al.
Ann Appl Stat 2009

P Goldberg et al.
Cancer Lett 2010

P Goldberg et al.
Cancer Lett 2010

P Goldberg et al.
2013

R Gentleman et al.
J Comput Graphical Stat 2007

F Leisch et al.
2002

Similar Publications