Back to Search Start Over

A Data Mining Approach Applied to the High School National Examination: Analysis of Aspects of Candidates to Brazilian Universities

Authors :
Vilson Soares de Siqueira
Márcio Dias de Lima
Diego de Castro Rodrigues
Marcos Dias da Conceição
Rommel M. Barbosa
Source :
Progress in Artificial Intelligence ISBN: 9783030302405, EPIA (1)
Publication Year :
2019
Publisher :
Springer International Publishing, 2019.

Abstract

In many college courses in several countries are used exams in a national scale, such as Gaokao, in China, Scholastic Aptitude Test - SAT and the American College Testing - ACT in the United States of American, Yuksekogretime Gecis Sinavi – YGS in Turkey, among others. This paper examines microdata from the High School National Examination (ENEM) database from Brazil. The database has 8,627,367 records, 166 attributes, and all experiments were performed based on the Spark architecture. The objective of this work is to examine microdata of the ENEM database applying data mining algorithms and creating an approach to handle big data and to predict the profile of those enrolled in ENEM. Through the standards found by the data mining algorithms with classification algorithms, it was possible to observe that family income, access to information, profession, and academic history of the parents were directly related to the performance of the candidates. And with a rules induction algorithm, it was possible to identify the patterns presented in each of the regions of Brazil, such as common characteristics when a candidate was approved and when not, essential factors as disciplines and particular characteristics of each region. This approach also enables the execution of large volumes of data in a simplified computational structure.

Details

ISBN :
978-3-030-30240-5
ISBNs :
9783030302405
Database :
OpenAIRE
Journal :
Progress in Artificial Intelligence ISBN: 9783030302405, EPIA (1)
Accession number :
edsair.doi...........e6c3024a7638fc18117252b3c7a37fe3
Full Text :
https://doi.org/10.1007/978-3-030-30241-2_1