Back to Search
Start Over
Comparison of Different Sampling Algorithms for Phenotype Prediction
- Source :
- Bioinformatics and Biomedical Engineering ISBN: 9783319787589, IWBBIO (2)
- Publication Year :
- 2018
- Publisher :
- Springer International Publishing, 2018.
-
Abstract
- In this paper, we compare different sampling algorithms used for identifying the defective pathways in highly underdetermined phenotype prediction problems. The first algorithm (Fisher’s ratio sampler) selects the most discriminatory genes and samples the high discriminatory genetic networks according to a prior probability that it is proportional to their individual Fisher’s ratio. The second one (holdout sampler) is inspired by the bootstrapping procedure used in regression analysis and uses the minimum-scale signatures found in different random hold outs to establish the most frequently sampled genes. The third one is a pure random sampler which randomly builds networks of differentially expressed genes. In all these algorithms, the likelihood of the different networks is established via leave one out cross-validation (LOOCV), and the posterior analysis of the most frequently sampled genes serves to establish the altered biological pathways. These algorithms are compared to the results obtained via Bayesian Networks (BNs). We show the application of these algorithms to a microarray dataset concerning Triple Negative Breast Cancers. This comparison shows that the Random, Fisher’s ratio and Holdout samplers are most effective than BNs, and all provide similar insights about the genetic mechanisms that are involved in this disease. Therefore, it can be concluded that all these samplers are good alternatives to Bayesian Networks which much lower computational demands. Besides this analysis confirms the insight that the altered pathways should be independent of the sampling methodology and the classifier that is used to infer them.
- Subjects :
- 0301 basic medicine
Underdetermined system
business.industry
Computer science
Bayesian network
Regression analysis
Pattern recognition
Phenotype
03 medical and health sciences
symbols.namesake
030104 developmental biology
0302 clinical medicine
030220 oncology & carcinogenesis
Prior probability
symbols
Artificial intelligence
business
Classifier (UML)
Triple negative
Gibbs sampling
Subjects
Details
- ISBN :
- 978-3-319-78758-9
- ISBNs :
- 9783319787589
- Database :
- OpenAIRE
- Journal :
- Bioinformatics and Biomedical Engineering ISBN: 9783319787589, IWBBIO (2)
- Accession number :
- edsair.doi...........50aa43879be1e554abbe0884ee4349e0