1. Using a Genetic Algorithm to optimize a stacking ensemble in data streaming scenarios
- Author
-
Paulo Novais, Diogo Ramos, Davide Carneiro, and Universidade do Minho
- Subjects
Science & Technology ,stacking ensemble ,Computer science ,Stacking ,Ciências Naturais::Ciências da Computação e da Informação ,Engenharia Eletrotécnica, Eletrónica e Informática [Engenharia e Tecnologia] ,02 engineering and technology ,Genetic algorithms ,Artificial Intelligence ,020204 information systems ,Genetic algorithm ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Ciências da Computação e da Informação [Ciências Naturais] ,optimization ,Algorithm ,random forest ,Engenharia e Tecnologia::Engenharia Eletrotécnica, Eletrónica e Informática - Abstract
The requirements of Machine Learning applications are changing rapidly. Machine Learning models need to deal with increasing volumes of data, and need to do so quicker as responses are expected more than ever in real-time. Plus, sources of data are becoming more and more dynamic, with patterns that change more frequently. This calls for new approaches and algorithms, that are able to efficiently deal with these challenges. In this paper we propose the use of a Genetic Algorithm to Optimize a Stacking Ensemble specifically developed for streaming scenarios. A pool of solutions is maintained in which each solution represents a distribution of weights in the ensemble. The Genetic Algorithm continuously optimizes these weights to minimize the cost function. Moreover, new models are added at regular intervals, trained on more recent data. These models eventually replace older and less accurate ones, making the ensemble adapt continuously do changes in the distribution of the data., This work was supported by the Northern Regional Operational Program, Portugal 2020 and European Union, trough the European Regional Development Fund (ERDF) and Fundos Europeus Estruturais e de Investimento (FEEI) in the scope of projects 39900 - 31/SI/2017 and NORTE-01-0145-FEDER023577, and by national funds through FCT - Fundacao para a Ciencia e a Tecnologia, through projects UIDB/04728/2020 and UID/CEC/00319/2019.
- Published
- 2020