IPAD: the Integrated Pathway Analysis Database for Systematic Enrichment Analysis.

Authors:
Fan Zhang
Fan Zhang
School of Pharmacy
Madison | United States
Renee Drabier
Renee Drabier
University of North Texas Health Science Center

BMC Bioinformatics 2012 11;13 Suppl 15:S7. Epub 2012 Sep 11.

Department of Academic and Institutional Resources and Technology, University of North Texas Health Science Center, Fort Worth, USA.

Background: Next-Generation Sequencing (NGS) technologies and Genome-Wide Association Studies (GWAS) generate millions of reads and hundreds of datasets, and there is an urgent need for a better way to accurately interpret and distill such large amounts of data. Extensive pathway and network analysis allow for the discovery of highly significant pathways from a set of disease vs. healthy samples in the NGS and GWAS. Knowledge of activation of these processes will lead to elucidation of the complex biological pathways affected by drug treatment, to patient stratification studies of new and existing drug treatments, and to understanding the underlying anti-cancer drug effects. There are approximately 141 biological human pathway resources as of Jan 2012 according to the Pathguide database. However, most currently available resources do not contain disease, drug or organ specificity information such as disease-pathway, drug-pathway, and organ-pathway associations. Systematically integrating pathway, disease, drug and organ specificity together becomes increasingly crucial for understanding the interrelationships between signaling, metabolic and regulatory pathway, drug action, disease susceptibility, and organ specificity from high-throughput omics data (genomics, transcriptomics, proteomics and metabolomics).

Results: We designed the Integrated Pathway Analysis Database for Systematic Enrichment Analysis (IPAD, http://bioinfo.hsc.unt.edu/ipad), defining inter-association between pathway, disease, drug and organ specificity, based on six criteria: 1) comprehensive pathway coverage; 2) gene/protein to pathway/disease/drug/organ association; 3) inter-association between pathway, disease, drug, and organ; 4) multiple and quantitative measurement of enrichment and inter-association; 5) assessment of enrichment and inter-association analysis with the context of the existing biological knowledge and a "gold standard" constructed from reputable and reliable sources; and 6) cross-linking of multiple available data sources.IPAD is a comprehensive database covering about 22,498 genes, 25,469 proteins, 1956 pathways, 6704 diseases, 5615 drugs, and 52 organs integrated from databases including the BioCarta, KEGG, NCI-Nature curated, Reactome, CTD, PharmGKB, DrugBank, PharmGKB, and HOMER. The database has a web-based user interface that allows users to perform enrichment analysis from genes/proteins/molecules and inter-association analysis from a pathway, disease, drug, and organ.Moreover, the quality of the database was validated with the context of the existing biological knowledge and a "gold standard" constructed from reputable and reliable sources. Two case studies were also presented to demonstrate: 1) self-validation of enrichment analysis and inter-association analysis on brain-specific markers, and 2) identification of previously undiscovered components by the enrichment analysis from a prostate cancer study.

Conclusions: IPAD is a new resource for analyzing, identifying, and validating pathway, disease, drug, organ specificity and their inter-associations. The statistical method we developed for enrichment and similarity measurement and the two criteria we described for setting the threshold parameters can be extended to other enrichment applications. Enriched pathways, diseases, drugs, organs and their inter-associations can be searched, displayed, and downloaded from our online user interface. The current IPAD database can help users address a wide range of biological pathway related, disease susceptibility related, drug target related and organ specificity related questions in human disease studies.

Download full-text PDF

Source
http://dx.doi.org/10.1186/1471-2105-13-S15-S7DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3439721PMC
April 2013
46 Reads

Publication Analysis

Top Keywords

disease drug
24
organ specificity
24
pathway disease
24
drug organ
20
enrichment analysis
20
inter-association analysis
12
pathway
12
analysis
11
drug
11
disease
10
enrichment
9
standard" constructed
8
"gold standard"
8
constructed reputable
8
drugs organs
8
reputable reliable
8
knowledge "gold
8
enrichment inter-association
8
inter-association pathway
8
disease susceptibility
8

Similar Publications

HOMER: a human organ-specific molecular electronic repository.

BMC Bioinformatics 2011 Oct 18;12 Suppl 10:S4. Epub 2011 Oct 18.

School of Informatics, Indiana University, Indianapolis, IN 46202, USA.

Background: Each organ has a specific function in the body. "Organ-specificity" refers to differential expressions of the same gene across different organs. An organ-specific gene/protein is defined as a gene/protein whose expression is significantly elevated in a specific human organ. Read More

View Article
October 2011

HPD: an online integrated human pathway database enabling systems biology studies.

BMC Bioinformatics 2009 Oct 8;10 Suppl 11:S5. Epub 2009 Oct 8.

Indiana University School of Informatics, Indianapolis, IN 46202, USA.

Background: Pathway-oriented experimental and computational studies have led to a significant accumulation of biological knowledge concerning three major types of biological pathway events: molecular signaling events, gene regulation events, and metabolic reaction events. A pathway consists of a series of molecular pathway events that link molecular entities such as proteins, genes, and metabolites. There are approximately 300 biological pathway resources as of April 2009 according to the Pathguide database; however, these pathway databases generally have poor coverage or poor quality, and are difficult to integrate, due to syntactic-level and semantic-level data incompatibilities. Read More

View Article
October 2009

3Omics: a web-based systems biology tool for analysis, integration and visualization of human transcriptomic, proteomic and metabolomic data.

BMC Syst Biol 2013 Jul 23;7:64. Epub 2013 Jul 23.

Graduate Institute of Biomedical Electronics and Bioinformatics, National Taiwan University, Taipei, Taiwan.

Background: Integrative and comparative analyses of multiple transcriptomics, proteomics and metabolomics datasets require an intensive knowledge of tools and background concepts. Thus, it is challenging for users to perform such analyses, highlighting the need for a single tool for such purposes. The 3Omics one-click web tool was developed to visualize and rapidly integrate multiple human inter- or intra-transcriptomic, proteomic, and metabolomic data by combining five commonly used analyses: correlation networking, coexpression, phenotyping, pathway enrichment, and GO (Gene Ontology) enrichment. Read More

View Article
July 2013

SNP-based pathway enrichment analysis for genome-wide association studies.

BMC Bioinformatics 2011 Apr 15;12:99. Epub 2011 Apr 15.

Department of Computer Science, University of California, Irvine, USA.

Background: Recently we have witnessed a surge of interest in using genome-wide association studies (GWAS) to discover the genetic basis of complex diseases. Many genetic variations, mostly in the form of single nucleotide polymorphisms (SNPs), have been identified in a wide spectrum of diseases, including diabetes, cancer, and psychiatric diseases. A common theme arising from these studies is that the genetic variations discovered by GWAS can only explain a small fraction of the genetic risks associated with the complex diseases. Read More

View Article
April 2011