Large-scale classification of P-glycoprotein inhibitors using SMILES-based descriptors

Abstract

P-glycoprotein (Pgp) inhibition has been considered as an effective strategy towards combating multidrug-resistant cancers. Owing to the substrate promiscuity of Pgp, the classification of its interacting ligands is not an easy task and is an ongoing issue of debate. Chemical structures can be represented by the simplified molecular input line entry system (SMILES) in the form of linear string of symbols. In this study, the SMILES notations of 2254 Pgp inhibitors including 1341 active, and 913 inactive compounds were used for the construction of a SMILE-based classification model using CORrelation And Logic (CORAL) software. The model provided an acceptable predictive performance as observed from statistical parameters consisting of accuracy, sensitivity and specificity that afforded values greater than 70% and MCC value greater than 0.6 for training, calibration and validation sets. In addition, the CORAL method highlighted chemical features that may contribute to increased and decreased Pgp inhibitory activities. This study highlights the potential of CORAL software for rapid screening of prospective compounds from a large chemical space and provides information that could aid in the design and development of potential Pgp inhibitors.

Publication
SAR and QSAR in Environmental Research
Date
Citation
Prachayasittikul V, Worachartcheewan A, Toropova AP, Toropov AA, Prachayasittikul V, Nantasenamat C. Large-scale classification of P-glycoprotein inhibitors using SMILES-based descriptors. SAR and QSAR in Environmental Research 28 (2017) 1-16.