Back to Search Start Over

Use of Random forest in the identification of important variables.

Authors :
Lovatti, Betina P.O.
Nascimento, Márcia H.C.
Neto, Álvaro C.
Castro, Eustáquio V.R.
Filgueiras, Paulo R.
Source :
Microchemical Journal. Mar2019, Vol. 145, p1129-1134. 6p.
Publication Year :
2019

Abstract

Abstract Random Forest (RF) technique has been shown to be promising in the supervised classification applied in different matrices. However, approaches to identifying significant variables that weight the model are scarce, in the classification problems. In this paper, we propose a methodology for the selection of variables of greater relevance in the construction of RF models. For the application of this methodology, classification models were developed to discriminating crude oil samples, about to their maximum pour point (MPP). In this sense, data from MPP (ASTM D5853) of 105 crude oil samples, their hydrogen (1H) NMR spectra and carbon (13C) NMR spectra were acquired. With MPP ranging from −54 °C to 39 °C, two classes were assigned: the first containing 43 samples with MPP value ≤ −9 °C, and, the second, 62 samples with MPP value > −9 °C. The 1H NMR models, with 90% accuracy, and 13C NMR, with 71% accuracy, were used in the selection of variable method. The results showed that the methodology proposed to select variables was effective in the distinction of the variables that best contributed to the discrimination of oils. Therefore, this new tool enabled a greater understanding of the interest chemical information, contained in the spectra and its relationship with the MPP property of the crude oil samples. Highlights • RF was applied to discriminate petroleum samples in relation to the maximum pour point. • 1H NMR and 13C NMR are efficient in discriminating samples in relation to pour point. • More important variables of the RF model were identified. • The maximum pour point depends on the equilibrium between saturated, aromatic and polar contents. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
0026265X
Volume :
145
Database :
Academic Search Index
Journal :
Microchemical Journal
Publication Type :
Academic Journal
Accession number :
134187681
Full Text :
https://doi.org/10.1016/j.microc.2018.12.028