Back to Search
Start Over
A new minimum description length based pruning technique for rule induction algorithms
- Source :
- Proceedings of the Institution of Mechanical Engineers, Part C: Journal of Mechanical Engineering Science. 222:1339-1352
- Publication Year :
- 2008
- Publisher :
- SAGE Publications, 2008.
-
Abstract
- When learning is based on noisy data, the induced rule sets have a tendency to overfit the training data, and this degrades the performance of the resulting classifier. Consequently, the ability to tolerate noise is a necessity for robust, practical learning methods. Pruning is a common way of handling noisy data. This paper presents a new pruning technique built on the sound foundation of the minimum description length principle. The proposed pruning technique has the advantage that it does not require the set of examples employed for pruning to be distinct from the set used to build the rule set. The new technique is designed to improve the performance of the RULe Extraction System (RULES) family of inductive learning algorithms, but can be used for pruning rule sets created by other learning algorithms. It was tested in RULES-6, the latest algorithm in the family, and showed significant performance improvements.
- Subjects :
- business.industry
Rule induction
Computer science
Mechanical Engineering
Overfitting
computer.software_genre
Machine learning
Knowledge extraction
Principal variation search
Null-move heuristic
Artificial intelligence
Data mining
Minimum description length
business
Classifier (UML)
computer
Algorithm
Killer heuristic
Subjects
Details
- ISSN :
- 20412983 and 09544062
- Volume :
- 222
- Database :
- OpenAIRE
- Journal :
- Proceedings of the Institution of Mechanical Engineers, Part C: Journal of Mechanical Engineering Science
- Accession number :
- edsair.doi...........fa58b52f03ab41ed9a769fbb05cd3f63
- Full Text :
- https://doi.org/10.1243/09544062jmes842