Back to Search Start Over

Rivality index neighbourhood algorithm with density and distances weighted schemes for the building of robust QSAR classification models with high reliable applicability domain.

Authors :
Luque Ruiz, I.
Gómez-Nieto, M.Á.
Source :
SAR & QSAR in Environmental Research. Aug2019, Vol. 30 Issue 8, p587-615. 29p.
Publication Year :
2019

Abstract

The rivality index (RI) is a normalized distance measurement between a molecule and their first nearest neighbours providing a robust prediction of the activity of a molecule based on the known activity of their nearest neighbours. Negative values of the RI describe molecules that would be correctly classified by a statistic algorithm and, vice versa, positive values of this index describe those molecules detected as outliers by the classification algorithms. In this paper, we have described a classification algorithm based on the RI and we have proposed four weighted schemes (kernels) for its calculation based on the measuring of different characteristics of the neighbourhood of molecules for each molecule of the dataset at established values of the threshold of neighbours. The results obtained have demonstrated that the proposed classification algorithm, based on the RI, generates more reliable and robust classification models than many of the more used and well-known machine learning algorithms. These results have been validated and corroborated by using 20 balanced and unbalanced benchmark datasets of different sizes and modelability. The classification models generated provide valuable information about the molecules of the dataset, the applicability domain of the models and the reliability of the predictions. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
1062936X
Volume :
30
Issue :
8
Database :
Academic Search Index
Journal :
SAR & QSAR in Environmental Research
Publication Type :
Academic Journal
Accession number :
138433688
Full Text :
https://doi.org/10.1080/1062936X.2019.1644666