Search our Database of Scientific Publications and Authors

I’m looking for a

    Details and Download Full Text PDF:
    isGPT: An optimized model to identify sub-Golgi protein types using SVM and Random Forest based feature selection.

    Artif Intell Med 2018 01 26;84:90-100. Epub 2017 Nov 26.
    Department of CSE, BUET, ECE Building, West Palasi, Dhaka 1205, Bangladesh. Electronic address:
    The Golgi Apparatus (GA) is a key organelle for protein synthesis within the eukaryotic cell. The main task of GA is to modify and sort proteins for transport throughout the cell. Proteins permeate through the GA on the ER (Endoplasmic Reticulum) facing side (cis side) and depart on the other side (trans side). Based on this phenomenon, we get two types of GA proteins, namely, cis-Golgi protein and trans-Golgi protein. Any dysfunction of GA proteins can result in congenital glycosylation disorders and some other forms of difficulties that may lead to neurodegenerative and inherited diseases like diabetes, cancer and cystic fibrosis. So, the exact classification of GA proteins may contribute to drug development which will further help in medication. In this paper, we focus on building a new computational model that not only introduces easy ways to extract features from protein sequences but also optimizes classification of trans-Golgi and cis-Golgi proteins. After feature extraction, we have employed Random Forest (RF) model to rank the features based on the importance score obtained from it. After selecting the top ranked features, we have applied Support Vector Machine (SVM) to classify the sub-Golgi proteins. We have trained regression model as well as classification model and found the former to be superior. The model shows improved performance over all previous methods. As the benchmark dataset is significantly imbalanced, we have applied Synthetic Minority Over-sampling Technique (SMOTE) to the dataset to make it balanced and have conducted experiments on both versions. Our method, namely, identification of sub-Golgi Protein Types (isGPT), achieves accuracy values of 95.4%, 95.9% and 95.3% for 10-fold cross-validation test, jackknife test and independent test respectively. According to different performance metrics, isGPT performs better than state-of-the-art techniques. The source code of isGPT, along with relevant dataset and detailed experimental results, can be found at
    PDF Download - Full Text Link
    ( Please be advised that this article is hosted on an external website not affiliated with
    Source Status ListingPossible

    Similar Publications

    A Novel Feature Extraction Method with Feature Selection to Identify Golgi-Resident Protein Types from Imbalanced Data.
    Int J Mol Sci 2016 Feb 6;17(2):218. Epub 2016 Feb 6.
    School of Control Science and Engineering, Shandong University, Jinan 250061, China.
    The Golgi Apparatus (GA) is a major collection and dispatch station for numerous proteins destined for secretion, plasma membranes and lysosomes. The dysfunction of GA proteins can result in neurodegenerative diseases. Therefore, accurate identification of protein subGolgi localizations may assist in drug development and understanding the mechanisms of the GA involved in various cellular processes. Read More
    DPP-PseAAC: A DNA-binding protein prediction model using Chou's general PseAAC.
    J Theor Biol 2018 May 16;452:22-34. Epub 2018 May 16.
    Department of CSE, BUET, ECE Building, West Palasi, Dhaka 1205, Bangladesh. Electronic address:
    A DNA-binding protein (DNA-BP) is a protein that can bind and interact with a DNA. Identification of DNA-BPs using experimental methods is expensive as well as time consuming. As such, fast and accurate computational methods are sought for predicting whether a protein can bind with a DNA or not. Read More
    Intelligent computational model for classification of sub-Golgi protein using oversampling and fisher feature selection methods.
    Artif Intell Med 2017 05 10;78:14-22. Epub 2017 May 10.
    Department of Computer Science, Abdul Wali Khan University, Mardan, Pakistan. Electronic address:
    Golgi is one of the core proteins of a cell, constitutes in both plants and animals, which is involved in protein synthesis. Golgi is responsible for receiving and processing the macromolecules and trafficking of newly processed protein to its intended destination. Dysfunction in Golgi protein is expected to cause many neurodegenerative and inherited diseases that may be cured well if they are detected effectively and timely. Read More
    Classification of toxicity effects of biotransformed hepatic drugs using whale optimized support vector machines.
    J Biomed Inform 2017 04 8;68:132-149. Epub 2017 Mar 8.
    Scientific Research Group in Egypt (SRGE), Egypt(1); Faculty of Computers and Information, Cairo University, Egypt. Electronic address:
    Measuring toxicity is an important step in drug development. Nevertheless, the current experimental methods used to estimate the drug toxicity are expensive and time-consuming, indicating that they are not suitable for large-scale evaluation of drug toxicity in the early stage of drug development. Hence, there is a high demand to develop computational models that can predict the drug toxicity risks. Read More