Methods for enhancing the reproducibility of biomedical research findings using electronic health records.

BioData Min 2017 11;10:31. Epub 2017 Sep 11.

EHR Research Group, Department of Non-communicable Disease Epidemiology, London School of Hygiene and Tropical Medicine, Keppel Streeet, London, WC1E 7HT UK.

Background: The ability of external investigators to reproduce published scientific findings is critical for the evaluation and validation of biomedical research by the wider community. However, a substantial proportion of health research using electronic health records (EHR), data collected and generated during clinical care, is potentially not reproducible mainly due to the fact that the implementation details of most data preprocessing, cleaning, phenotyping and analysis approaches are not systematically made available or shared. With the complexity, volume and variety of electronic health record data sources made available for research steadily increasing, it is critical to ensure that scientific findings from EHR data are reproducible and replicable by researchers. Reporting guidelines, such as RECORD and STROBE, have set a solid foundation by recommending a series of items for researchers to include in their research outputs. Researchers however often lack the technical tools and methodological approaches to actuate such recommendations in an efficient and sustainable manner.

Results: In this paper, we review and propose a series of methods and tools utilized in adjunct scientific disciplines that can be used to enhance the reproducibility of research using electronic health records and enable researchers to report analytical approaches in a transparent manner. Specifically, we discuss the adoption of scientific software engineering principles and best-practices such as test-driven development, source code revision control systems, literate programming and the standardization and re-use of common data management and analytical approaches.

Conclusion: The adoption of such approaches will enable scientists to systematically document and share EHR analytical workflows and increase the reproducibility of biomedical research using such complex data sources.

Download full-text PDF

Source
http://dx.doi.org/10.1186/s13040-017-0151-7DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5594436PMC
September 2017
21 Reads

Publication Analysis

Top Keywords

electronic health
16
health records
12
reproducibility biomedical
8
ehr data
8
data sources
8
scientific findings
8
data
6
health
5
efficient sustainable
4
recommendations efficient
4
sustainable mannerresults
4
mannerresults paper
4
actuate recommendations
4
methodological approaches
4
lack technical
4
technical tools
4
tools methodological
4
paper review
4
approaches actuate
4
review propose
4

Altmetric Statistics

References

(Supplied by CrossRef)

GM Weber et al.
JAMA 2014

SC Denaxas et al.
Int J Epidemiol 2012

FS Collins et al.
N Engl J Med 2015

C Sudlow et al.
PLoS Med 2015

O Gottesman et al.
Genet Med 2013

DR Crosslin et al.
Hum Genet 2012

A Timmis et al.
Bmj 2016

AD Shah et al.
Lancet Diabetes Endocrinol 2015

E Rapsomaniki et al.
Lancet 2014

E Rapsomaniki et al.
Eur Heart J 2013

Similar Publications