Back to Search
Start Over
Wavelet Screening: a novel approach to analyzing GWAS data
- Source :
- BMC Bioinformatics, Vol 22, Iss 1, Pp 1-20 (2021), BMC Bioinformatics
- Publication Year :
- 2021
- Publisher :
- BMC, 2021.
-
Abstract
- Background Traditional methods for single-variant genome-wide association study (GWAS) incur a substantial multiple-testing burden because of the need to test for associations with a vast number of single-nucleotide polymorphisms (SNPs) simultaneously. Further, by ignoring more complex joint effects of nearby SNPs within a given region, these methods fail to consider the genomic context of an association with the outcome. Results To address these shortcomings, we present a more powerful method for GWAS, coined ‘Wavelet Screening’ (WS), that greatly reduces the number of tests to be performed. This is achieved through the use of a sliding-window approach based on wavelets to sequentially screen the entire genome for associations. Wavelets are oscillatory functions that are useful for analyzing the local frequency and time behavior of signals. The signals can then be divided into different scale components and analyzed separately. In the current setting, we consider a sequence of SNPs as a genetic signal, and for each screened region, we transform the genetic signal into the wavelet space. The null and alternative hypotheses are modeled using the posterior distribution of the wavelet coefficients. WS is enhanced by using additional information from the regression coefficients and by taking advantage of the pyramidal structure of wavelets. When faced with more complex genetic signals than single-SNP associations, we show via simulations that WS provides a substantial gain in power compared to both the traditional GWAS modeling and another popular regional association test called SNP-set (Sequence) Kernel Association Test (SKAT). To demonstrate feasibility, we applied WS to a large Norwegian cohort (N=8006) with genotypes and information available on gestational duration. Conclusions WS is a powerful and versatile approach to analyzing whole-genome data and lends itself easily to investigating various omics data types. Given its broader focus on the genomic context of an association, WS may provide additional insight into trait etiology by revealing genes and loci that might have been missed by previous efforts.
- Subjects :
- Wavelet regression
Genotype
Computer science
QH301-705.5
Association (object-oriented programming)
Posterior probability
Computer applications to medicine. Medical informatics
R858-859.7
SNP
Context (language use)
Genome-wide association study
computer.software_genre
Polymorphism, Single Nucleotide
Biochemistry
Wavelet
Structural Biology
Humans
GWAS
Multiple testing
Biology (General)
Molecular Biology
Methodology Article
Applied Mathematics
Genomics
Computer Science Applications
Phenotype
Null (SQL)
Kernel (statistics)
Multiple comparisons problem
Polygenic association
Data mining
computer
Genome-Wide Association Study
Subjects
Details
- Language :
- English
- ISSN :
- 14712105
- Volume :
- 22
- Issue :
- 1
- Database :
- OpenAIRE
- Journal :
- BMC Bioinformatics
- Accession number :
- edsair.doi.dedup.....710d102d7cc3953d3529d567b3b0d28e