Back to Search Start Over

A new minimum description length based pruning technique for rule induction algorithms

Authors :
A. A. Afify
Duc Truong Pham
Source :
Proceedings of the Institution of Mechanical Engineers, Part C: Journal of Mechanical Engineering Science. 222:1339-1352
Publication Year :
2008
Publisher :
SAGE Publications, 2008.

Abstract

When learning is based on noisy data, the induced rule sets have a tendency to overfit the training data, and this degrades the performance of the resulting classifier. Consequently, the ability to tolerate noise is a necessity for robust, practical learning methods. Pruning is a common way of handling noisy data. This paper presents a new pruning technique built on the sound foundation of the minimum description length principle. The proposed pruning technique has the advantage that it does not require the set of examples employed for pruning to be distinct from the set used to build the rule set. The new technique is designed to improve the performance of the RULe Extraction System (RULES) family of inductive learning algorithms, but can be used for pruning rule sets created by other learning algorithms. It was tested in RULES-6, the latest algorithm in the family, and showed significant performance improvements.

Details

ISSN :
20412983 and 09544062
Volume :
222
Database :
OpenAIRE
Journal :
Proceedings of the Institution of Mechanical Engineers, Part C: Journal of Mechanical Engineering Science
Accession number :
edsair.doi...........fa58b52f03ab41ed9a769fbb05cd3f63
Full Text :
https://doi.org/10.1243/09544062jmes842