Back to Search Start Over

Development, study, and comparison of models of cross-immunity to the influenza virus using statistical methods and machine learning

Development, study, and comparison of models of cross-immunity to the influenza virus using statistical methods and machine learning

Authors :
Marina N. Asatryan
Ilya S. Shmyr
Boris I. Timofeev
Dmitrii N. Shcherbinin
Vaagn G. Agasaryan
Tatiana A. Timofeeva
Ivan F. Ershov
Elita R. Gerasimuk
Anna V. Nozdracheva
Tatyana A. Semenenko
Denis Yu. Logunov
Aleksander L. Gintsburg
Source :
Вопросы вирусологии, Vol 69, Iss 4, Pp 349-362 (2024)
Publication Year :
2024
Publisher :
Central Research Institute for Epidemiology, 2024.

Abstract

Introduction. The World Health Organization considers the values of antibody titers in the hemagglutination inhibition assay as one of the most important criteria for assessing successful vaccination. Mathematical modeling of cross-immunity allows for identification on a real-time basis of new antigenic variants, which is of paramount importance for human health. Materials and methods. This study uses statistical methods and machine learning techniques from simple to complex: logistic regression model, random forest method, and gradient boosting. The calculations used the AAindex matrices in parallel to the Hamming distance. The calculations were carried out with different types and values of antigenic escape thresholds, on four data sets. The results were compared using common binary classification metrics. Results. Significant differentiation is shown depending on the data sets used. The best results were demonstrated by all three models for the forecast autumn season of 2022, which were preliminary trained on the February season of the same year (Auroc 0.934; 0.958; 0.956, respectively). The lowest results were obtained for the entire forecast year 2023, they were set up on data from two seasons of 2022 (Aucroc 0.614; 0.658; 0.775). The dependence of the results on the types of thresholds used and their values turned out to be insignificant. The additional use of AAindex matrices did not significantly improve the results of the models without introducing significant deterioration. Conclusion. More complex models show better results. When developing cross-immunity models, testing on a variety of data sets is important to make strong claims about their prognostic robustness.

Details

Language :
English, Russian
ISSN :
05074088 and 24112097
Volume :
69
Issue :
4
Database :
Directory of Open Access Journals
Journal :
Вопросы вирусологии
Publication Type :
Academic Journal
Accession number :
edsdoj.74daf3ff47a14a918fe07fefb7c30524
Document Type :
article
Full Text :
https://doi.org/10.36233/0507-4088-250