Back to Search
Start Over
A Dynamic Decision-Making Method Based on Ensemble Methods for Complex Unbalanced Data
- Source :
- Web Information Systems Engineering – WISE 2019 ISBN: 9783030342227, WISE
- Publication Year :
- 2019
- Publisher :
- Springer International Publishing, 2019.
-
Abstract
- Class imbalance has been proven to seriously hinder the precision of many standard learning algorithms. To solve this problem, a number of methods have been proposed, for example, the distance-based balancing ensemble method that learns the unbalanced dataset by converting it into multiple balanced subsets on which sub-classifiers are built. However, the class-imbalance problem is usually accompanied by other data-complexity problems such as class overlap, small disjuncts, and noise instance. Current algorithms developed for primary unbalanced-data problems cannot address the complex-data problems at the same time. Some of these algorithms even exacerbate the class-overlap and small-disjuncts problems after trying to address the complex-data problem. On this account, this study proposes a dynamic ensemble selection decision-making (DESD) method. The DESD first repeats the random-splitting technique to divide the dataset into multiple balanced subsets that contain no or few class-overlap and small-disjunct problems. Then, the classifiers are built on these subsets to compose the candidate classifier pool. To select the most appropriate classifiers from the candidate classifier pool for the classification of each query instance, we use a weighting mechanism to highlight the competence of classifiers that are more powerful in classifying minority instances belonging to the local region in which the query instance is located. Tests with 15 standard datasets from public repositories are performed to demonstrate the effectiveness of the DESD method. The results show that the precision of the DESD method outperforms other ensemble methods.
- Subjects :
- 0209 industrial biotechnology
Ensemble selection
Computer science
business.industry
02 engineering and technology
A-weighting
Machine learning
computer.software_genre
Ensemble learning
Class imbalance
020901 industrial engineering & automation
0202 electrical engineering, electronic engineering, information engineering
020201 artificial intelligence & image processing
Artificial intelligence
Unbalanced data
business
computer
Classifier (UML)
Dynamic decision-making
Subjects
Details
- ISBN :
- 978-3-030-34222-7
- ISBNs :
- 9783030342227
- Database :
- OpenAIRE
- Journal :
- Web Information Systems Engineering – WISE 2019 ISBN: 9783030342227, WISE
- Accession number :
- edsair.doi...........636f8f2ad6b011e8dd223f4624ef6d3b
- Full Text :
- https://doi.org/10.1007/978-3-030-34223-4_23