1. مدل بندی و داده کاوی داده های جهانی بیماران ویروس کووید ۱.
- Author
-
مصطفی بسکابادی and مهدی دوست پرست
- Subjects
- *
CART algorithms , *COVID-19 , *RHINORRHEA , *RECEIVER operating characteristic curves , *DECISION trees , *COUGH , *SNEEZING - Abstract
Introduction: Data mining techniques, including decision tree algorithms can be used for modeling and identifying those at risk of developing COVID-19. The main goal of this study is estimating the risk of death of patients due to COVID-19, using the classification and regression tree (CART) algorithm based on the observed effective factors. Methods: This paper is an analytical study and the data of all patients with COVID-19 registered on the Kaggle site through Johns Hopkins University, was extracted. There was a total of 26,031 records from various countries. Data analysis was performed using JMP statistical software version 13. In the modeling section, decision tree algorithm and CART model were used. Results: The results of the classification and regression tree showed that among quantitative variables, age, the interval between hospitalization and result, the interval between onset of symptoms and test result, and the interval between hospitalization and test result, and the qualitative variable of gender were the most important factors affecting the outcome of patient treatment, respectively. According to the analysis of words, fever, cough, sore throat, fatigue, weakness, headache, chills, and runny nose, respectively, were the most common symptoms among patients with this disease. Conclusion: The accuracy of the fitted model was shown to be 94.1% for experimental data and 91.1% for educational data using the area under the receiver operating characteristic (ROC) curve. [ABSTRACT FROM AUTHOR]
- Published
- 2020