1. QSPR Models for Prediction of Aqueous Solubility: Exploring the Potency of Randić-type Indices
- Author
-
Sluga, Janja, Venko, Katja, Drgan, Viktor, Novič, Marjana, Sluga, Janja, Venko, Katja, Drgan, Viktor, and Novič, Marjana
- Abstract
The development of QSPR models to predict aqueous solubility (logS) is presented. A structurally diverse set of over 1600 compounds with experimentally determined solubility values (AqSolDB database) is used for building the data-driven models based on multiple linear regression (MLR) and artificial neural network (ANN) methods to predict aqueous solubility. Molecular structures are encoded by numerous structural descriptors, including the connectivity index developed by Randić in 1975, and many later derived variations. To evaluate the potency of Randić-like descriptors in the structure-property relationship, we developed models based on two sets of descriptors, first using only Randić-like descriptors calculated with Dragon, and second using 17 commonly applied descriptors available in the AqSolDB database. All models were validated with external prediction sets, with the RMSE ranging from 0.8 to 1.1. Interestingly, the RMSE of predicted LogS values of models based only on the Randić-like descriptors were in average just 0.1 larger than the models with 17 descriptors preselected as suitable for modelling logS. This work is licensed under a Creative Commons Attribution 4.0 International License.
- Published
- 2020