1. Algoritmo híbrido basado en aprendizaje computacional para el manejo de datos faltantes en aplicaciones OLAP.
- Author
-
Hernández García, Claudia Liliana and Rodríguez Rodríguez, Jorge Enrique
- Subjects
- *
MACHINE learning , *MISSING data (Statistics) , *ALGORITHMS , *K-nearest neighbor classification , *ARTIFICIAL neural networks , *ELECTRONIC data processing - Abstract
This paper shows the development and use of a hybrid algorithm of machine learning for the missing values predict task, with this task that is carried out while the data preprocessing phase. Firstly, we go through the problem to solve, which is pointed to the study and analysis of different techniques for the missing values predict in order to suggest a hybrid technique as a product of this research for the task and link it to the OLAP (On-Line Analytical Processing) technology. Then, justifying the research methodology (scientific descriptive - probing) applied in this project. The techniques review was carried out filling out the missing values; based on the techniques proof and the study cases, k-Nearest Neighbors algorithms and artificial neural networks were selected and a hybrid technique (KMediaSom) was suggested, applied to a synthetic data set and a real one, coming from a OLAP; for the algorithms implementation was used Matlab. Right away, the analysis and results are set out in order to specify its applicability about efficacy and time complexity. Results are suitable as for the synthetic data set as for the real one; according to the test signs achieved. Finally, the conclusions, where it i's proved that the technique or hybrid algorithms generate better results than the techniques used by separately. [ABSTRACT FROM AUTHOR]
- Published
- 2016