IEEE/ACM Trans Comput Biol Bioinform 2019 Apr 15. Epub 2019 Apr 15.
Protein-protein interaction (PPI) network models interconnections between protein-encoding genes. Group of proteins that perform similar functions are often connected to each other in PPI network. The corresponding genes form pathways or functional modules. Mutation in protein-encoding genes affect behavior of pathways. This results in initiation, progression and severity of diseases that propagates through pathways. In this work, we integrate mutation, survival information of patients and PPI network to identify connected subnetworks associated with survival. We define computational problem using fitness function called log-rank statistic to score subnetworks. Log-rank statistic compares the survival between two populations. We propose a novel method, Survival Associated Mutated Subnetwork (SAMS) that adopts genetic algorithm strategy to find connected subnetwork within PPI network whose mutation yields highest log-rank statistic. We test on real cancer and synthetic datasets. SAMS generate solutions in negligible time while state-of-art method in literature takes exponential time. Log-rank statistic of SAMS selected mutated subnetworks are comparable to the method. Our result genesets show significant overlap with well-known cancer driver genes derived from curated datasets and studies in literature, display high text-mining score in terms of number of citations combined with disease-specific keywords in PubMed and identify pathways having high biological relevance.