Mosquera, Rodolfo, Castrillón, Omar D., and Parra, Liliana
Abstract
This paper presents a new methodology based on the application of Support Vector Machine algorithms, Naïve Bayes and Genetic Algorithms in diagnostics of psychosocial evaluations for the identification and prediction of the psychosocial risk level of public-school teachers in Colombia. A comparative study of the model of machine learning for prediction was carried out: Support Vector Machines (SVM) and Naïve Bayes, in two stages, first with all the variables and second, reducing the dimensionality of the database applying genetic algorithms, The best forty variables with the best efficiency in prediction accuracy were selected. The database used consisted of 3000 epidemiological records, which corresponded to teachers from public schools in the metropolitan area of a Colombian city. The use of SVM easily detected variables of physiological type and the best prediction performance was obtained with accuracy of 96.3%. [ABSTRACT FROM AUTHOR]
Mosquera, Rodolfo, Castrillón, Omar D., and Parra, Liliana
Abstract
This paper presents a new methodology based on machine learning techniques in diagnostics of psychosocial assessments to identify the risk level in teachers of public schools in Colombia. A comparative study of three important models of machine learning for prediction was done: artificial neural networks, decision trees and naive bayes, reducing the dimensionality of the data. This was done by applying genetic algorithms, algorithm of the expected amount of information, the algorithm GainRatioAttributeEval, Pearson's relation coefficient and principal components analysis. A database was used with 5340 epidemiological records, corresponding to psychosocial evaluations of teachers from public schools in the metropolitan area of a Colombian city. The best predictive performance was obtained with the model of artificial neural networks with an accuracy 93%. [ABSTRACT FROM AUTHOR]