Back to Search
Start Over
Tumor classification and marker gene prediction by feature selection and fuzzy c-means clustering using microarray data.
- Source :
-
BMC bioinformatics [BMC Bioinformatics] 2003 Dec 02; Vol. 4, pp. 60. Date of Electronic Publication: 2003 Dec 02. - Publication Year :
- 2003
-
Abstract
- Background: Using DNA microarrays, we have developed two novel models for tumor classification and target gene prediction. First, gene expression profiles are summarized by optimally selected Self-Organizing Maps (SOMs), followed by tumor sample classification by Fuzzy C-means clustering. Then, the prediction of marker genes is accomplished by either manual feature selection (visualizing the weighted/mean SOM component plane) or automatic feature selection (by pair-wise Fisher's linear discriminant).<br />Results: The proposed models were tested on four published datasets: (1) Leukemia (2) Colon cancer (3) Brain tumors and (4) NCI cancer cell lines. The models gave class prediction with markedly reduced error rates compared to other class prediction approaches, and the importance of feature selection on microarray data analysis was also emphasized.<br />Conclusions: Our models identify marker genes with predictive potential, often better than other available methods in the literature. The models are potentially useful for medical diagnostics and may reveal some insights into cancer classification. Additionally, we illustrated two limitations in tumor classification from microarray data related to the biology underlying the data, in terms of (1) the class size of data, and (2) the internal structure of classes. These limitations are not specific for the classification models used.
- Subjects :
- Brain Neoplasms classification
Brain Neoplasms genetics
Cell Line, Tumor
Chromosome Mapping methods
Chromosome Mapping statistics & numerical data
Cluster Analysis
Colonic Neoplasms classification
Colonic Neoplasms genetics
Computational Biology statistics & numerical data
Female
Gene Expression Regulation, Neoplastic genetics
Humans
Leukemia classification
Leukemia genetics
Ovarian Neoplasms classification
Ovarian Neoplasms genetics
Ovarian Neoplasms pathology
Predictive Value of Tests
Biomarkers, Tumor genetics
Fuzzy Logic
Gene Expression Profiling statistics & numerical data
Genes, Neoplasm genetics
Neoplasms classification
Neoplasms genetics
Oligonucleotide Array Sequence Analysis statistics & numerical data
Subjects
Details
- Language :
- English
- ISSN :
- 1471-2105
- Volume :
- 4
- Database :
- MEDLINE
- Journal :
- BMC bioinformatics
- Publication Type :
- Academic Journal
- Accession number :
- 14651757
- Full Text :
- https://doi.org/10.1186/1471-2105-4-60