Back to Search
Start Over
Variable Prioritization in Nonlinear Black Box Methods: A Genetic Association Case Study
- Source :
- Crawford, L; Flaxman, SR; Runcie, DE; & West, M. (2018). Variable Prioritization in Nonlinear Black Box Methods: A Genetic Association Case Study. UC Davis: Retrieved from: http://www.escholarship.org/uc/item/9083t4bq, Ann. Appl. Stat. 13, no. 2 (2019), 958-989, Ann Appl Stat
- Publication Year :
- 2018
- Publisher :
- eScholarship, University of California, 2018.
-
Abstract
- The central aim in this paper is to address variable selection questions in nonlinear and nonparametric regression. Motivated by statistical genetics, where nonlinear interactions are of particular interest, we introduce a novel and interpretable way to summarize the relative importance of predictor variables. Methodologically, we develop the "RelATive cEntrality" (RATE) measure to prioritize candidate genetic variants that are not just marginally important, but whose associations also stem from significant covarying relationships with other variants in the data. We illustrate RATE through Bayesian Gaussian process regression, but the methodological innovations apply to other "black box" methods. It is known that nonlinear models often exhibit greater predictive accuracy than linear models, particularly for phenotypes generated by complex genetic architectures. With detailed simulations and two real data association mapping studies, we show that applying RATE enables an explanation for this improved performance.<br />28 pages, 5 figures, 1 tables; Supplementary Material
- Subjects :
- 0301 basic medicine
SELECTION
FOS: Computer and information sciences
Computer science
Gaussian processes
computer.software_genre
01 natural sciences
Quantitative Biology - Quantitative Methods
010104 statistics & probability
REGRESSION-MODELS
Statistics - Machine Learning
Black box
variable prioritization
POPULATION
Quantitative Methods (q-bio.QM)
education.field_of_study
QUANTITATIVE TRAIT LOCI
0104 Statistics
Linear model
Regression analysis
stat.ML
BODY-WEIGHT
stat.ME
Modeling and Simulation
Physical Sciences
statistical genetics
Statistics, Probability and Uncertainty
Statistics and Probability
Statistics & Probability
Population
Bayesian probability
Feature selection
Machine Learning (stat.ML)
Machine learning
Statistics - Applications
Generalized linear mixed model
Article
LINEAR MIXED MODELS
Methodology (stat.ME)
03 medical and health sciences
1403 Econometrics
Nonlinear regression
EPISTASIS
Applications (stat.AP)
0101 mathematics
GENOME-WIDE ASSOCIATION
education
stat.AP
Statistics - Methodology
Science & Technology
q-bio.QM
business.industry
COMPLEX TRAITS
centrality measures
Nonparametric regression
030104 developmental biology
FOS: Biological sciences
genome-wide association studies
STATISTICAL-METHODS
Artificial intelligence
business
computer
Mathematics
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Journal :
- Crawford, L; Flaxman, SR; Runcie, DE; & West, M. (2018). Variable Prioritization in Nonlinear Black Box Methods: A Genetic Association Case Study. UC Davis: Retrieved from: http://www.escholarship.org/uc/item/9083t4bq, Ann. Appl. Stat. 13, no. 2 (2019), 958-989, Ann Appl Stat
- Accession number :
- edsair.doi.dedup.....a8f223dce3988428807cba54cd9deb62