Back to Search
Start Over
Experimental Analysis of Hyperparameters for Deep Learning-Based Churn Prediction in the Banking Sector
- Source :
- Computation, Vol 9, Iss 34, p 34 (2021), Computation, Volume 9, Issue 3
- Publication Year :
- 2021
- Publisher :
- MDPI AG, 2021.
-
Abstract
- Until recently, traditional machine learning techniques (TMLTs) such as multilayer perceptrons (MLPs) and support vector machines (SVMs) have been used successfully for churn prediction, but with significant efforts expended on the configuration of the training parameters. The selection of the right training parameters for supervised learning is almost always experimentally determined in an ad hoc manner. Deep neural networks (DNNs) have shown significant predictive strength over TMLTs when used for churn predictions. However, the more complex architecture of DNNs and their capacity to process huge amounts of non-linear input data demand more time and effort to configure the training hyperparameters for DNNs during churn modeling. This makes the process more challenging for inexperienced machine learning practitioners and researchers. So far, limited research has been done to establish the effects of different hyperparameters on the performance of DNNs during churn prediction. There is a lack of empirically derived heuristic knowledge to guide the selection of hyperparameters when DNNs are used for churn modeling. This paper presents an experimental analysis of the effects of different hyperparameters when DNNs are used for churn prediction in the banking sector. The results from three experiments revealed that the deep neural network (DNN) model performed better than the MLP when a rectifier function was used for activation in the hidden layers and a sigmoid function was used in the output layer. The performance of the DNN was better when the batch size was smaller than the size of the test set data, while the RemsProp training algorithm had better accuracy when compared with the stochastic gradient descent (SGD), Adam, AdaGrad, Adadelta, and AdaMax algorithms. The study provides heuristic knowledge that could guide researchers and practitioners in machine learning-based churn prediction from the tabular data for customer relationship management in the banking sector when DNNs are used.
- Subjects :
- General Computer Science
Computer science
churn modeling
02 engineering and technology
Machine learning
computer.software_genre
supervised learning
lcsh:QA75.5-76.95
Theoretical Computer Science
customer relationship management
020204 information systems
0202 electrical engineering, electronic engineering, information engineering
Artificial neural network
business.industry
Applied Mathematics
Deep learning
Supervised learning
Rectifier (neural networks)
Perceptron
Support vector machine
Stochastic gradient descent
machine learning
deep neural networks
Modeling and Simulation
Test set
churn prediction
020201 artificial intelligence & image processing
Artificial intelligence
lcsh:Electronic computers. Computer science
business
computer
Subjects
Details
- Language :
- English
- ISSN :
- 20793197
- Volume :
- 9
- Issue :
- 34
- Database :
- OpenAIRE
- Journal :
- Computation
- Accession number :
- edsair.doi.dedup.....62a5eb75d9310c753f465d424660ad3b