Start Over

Classification of high dimensional biomedical data based on feature selection using redundant removal.

Authors :: Zhang, Bingtao
Cao, Peng
Source :: PLoS ONE; 4/9/2019, Vol. 14 Issue 4, p1-19, 19p
Publication Year :: 2019
Abstract: High dimensional biomedical data contain tens of thousands of features, accurate and effective identification of the core features in these data can be used to assist diagnose related diseases. However, there are often a large number of irrelevant or redundant features in biomedical data, which seriously affect subsequent classification accuracy and machine learning efficiency. To solve this problem, a novel filter feature selection algorithm based on redundant removal (FSBRR) is proposed to classify high dimensional biomedical data in this paper. First of all, two redundant criteria are determined by vertical relevance (the relationship between feature and class attribute) and horizontal relevance (the relationship between feature and feature). Secondly, to quantify redundant criteria, an approximate redundancy feature framework based on mutual information (MI) is defined to remove redundant and irrelevant features. To evaluate the effectiveness of our proposed algorithm, controlled trials based on typical feature selection algorithm are conducted using three different classifiers, and the experimental results indicate that the FSBRR algorithm can effectively reduce the feature dimension and improve the classification accuracy. In addition, an experiment of small sample dataset is designed and conducted in the section of discussion and analysis to clarify the specific implementation process of FSBRR algorithm more clearly. [ABSTRACT FROM AUTHOR]

Subjects :: FEATURE selection
DATABASES
CLASSIFICATION
MACHINE learning
PHYSICAL sciences
LIFE sciences

Details

Language :: English
ISSN :: 19326203
Volume :: 14
Issue :: 4
Database :: Complementary Index
Journal :: PLoS ONE
Publication Type :: Academic Journal
Accession number :: 135808130
Full Text :: https://doi.org/10.1371/journal.pone.0214406

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Classification of high dimensional biomedical data based on feature selection using redundant removal.

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Classification of high dimensional biomedical data based on feature selection using redundant removal.

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources