Classification of toxicity effects of biotransformed hepatic drugs using whale optimized support vector machines.

Authors:
Alaa Tharwat
Alaa Tharwat
Suez Canal University
Yasmine S Moemen
Yasmine S Moemen
National Liver Institute
Dallas | United States
Aboul Ella Hassanien
Aboul Ella Hassanien
Cairo University
Egypt

J Biomed Inform 2017 04 8;68:132-149. Epub 2017 Mar 8.

Scientific Research Group in Egypt (SRGE), Egypt(1); Faculty of Computers and Information, Cairo University, Egypt. Electronic address:

Measuring toxicity is an important step in drug development. Nevertheless, the current experimental methods used to estimate the drug toxicity are expensive and time-consuming, indicating that they are not suitable for large-scale evaluation of drug toxicity in the early stage of drug development. Hence, there is a high demand to develop computational models that can predict the drug toxicity risks. In this study, we used a dataset that consists of 553 drugs that biotransformed in liver. The toxic effects were calculated for the current data, namely, mutagenic, tumorigenic, irritant and reproductive effect. Each drug is represented by 31 chemical descriptors (features). The proposed model consists of three phases. In the first phase, the most discriminative subset of features is selected using rough set-based methods to reduce the classification time while improving the classification performance. In the second phase, different sampling methods such as Random Under-Sampling, Random Over-Sampling and Synthetic Minority Oversampling Technique (SMOTE), BorderLine SMOTE and Safe Level SMOTE are used to solve the problem of imbalanced dataset. In the third phase, the Support Vector Machines (SVM) classifier is used to classify an unknown drug into toxic or non-toxic. SVM parameters such as the penalty parameter and kernel parameter have a great impact on the classification accuracy of the model. In this paper, Whale Optimization Algorithm (WOA) has been proposed to optimize the parameters of SVM, so that the classification error can be reduced. The experimental results proved that the proposed model achieved high sensitivity to all toxic effects. Overall, the high sensitivity of the WOA+SVM model indicates that it could be used for the prediction of drug toxicity in the early stage of drug development.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.jbi.2017.03.002DOI Listing
April 2017
4 Reads

Publication Analysis

Top Keywords

drug toxicity
16
drug development
12
drug
9
support vector
8
proposed model
8
early stage
8
vector machines
8
stage drug
8
toxicity early
8
toxic effects
8
high sensitivity
8
classification
5
toxicity
5
selected rough
4
subset features
4
features selected
4
whale optimization
4
paper whale
4
classification time
4
time improving
4

Similar Publications

A Predictive Model for Toxicity Effects Assessment of Biotransformed Hepatic Drugs Using Iterative Sampling Method.

Sci Rep 2016 12 9;6:38660. Epub 2016 Dec 9.

Scientific Research Group in Egypt, (SRGE), Cairo, Egypt.

Measuring toxicity is one of the main steps in drug development. Hence, there is a high demand for computational models to predict the toxicity effects of the potential drugs. In this study, we used a dataset, which consists of four toxicity effects:mutagenic, tumorigenic, irritant and reproductive effects. Read More

View Article
December 2016

Classification of Imbalanced Data by Oversampling in Kernel Space of Support Vector Machines.

IEEE Trans Neural Netw Learn Syst 2017 Oct 10. Epub 2017 Oct 10.

Historical data sets for fault stage diagnosis in industrial machines are often imbalanced and consist of multiple categories or classes. Learning discriminative models from such data sets is challenging due to the lack of representative data and the bias of traditional classifiers toward the majority class. Sampling methods like synthetic minority oversampling technique (SMOTE) have been traditionally used for such problems to artificially balance the data set before being trained by a classifier. Read More

View Article
October 2017

CURE-SMOTE algorithm and hybrid algorithm for feature selection and parameter optimization based on random forests.

Authors:
Li Ma Suohai Fan

BMC Bioinformatics 2017 Mar 14;18(1):169. Epub 2017 Mar 14.

School of Information Science and Technology, Jinan University, Guangzhou, 510632, China.

Background: The random forests algorithm is a type of classifier with prominent universality, a wide application range, and robustness for avoiding overfitting. But there are still some drawbacks to random forests. Therefore, to improve the performance of random forests, this paper seeks to improve imbalanced data processing, feature selection and parameter optimization. Read More

View Article
March 2017

Diagnosis of Brain Metastases from Lung Cancer Using a Modified Electromagnetism like Mechanism Algorithm.

J Med Syst 2016 Jan 14;40(1):35. Epub 2015 Nov 14.

School of Dentistry, College of Oral Medicine, Taipei Medical University, Taipei 110, Taiwan, Republic of China.

Brain metastases are commonly found in patients that are diagnosed with primary malignancy on their lung. Lung cancer patients with brain metastasis tend to have a poor survivability, which is less than 6 months in median. Therefore, an early and effective detection system for such disease is needed to help prolong the patients' survivability and improved their quality of life. Read More

View Article
January 2016