Back to Search Start Over

Leveraging functional annotations in genetic risk prediction for human complex diseases

Authors :
Ryan L Powles
Xinran Xu
Fang Fang
Can Yang
Xinwei Yao
Yiming Hu
Qiongshi Lu
Hongyu Zhao
Source :
PLoS Computational Biology, Vol 13, Iss 6, p e1005589 (2017), PLoS Computational Biology
Publication Year :
2017
Publisher :
Public Library of Science (PLoS), 2017.

Abstract

Genetic risk prediction is an important goal in human genetics research and precision medicine. Accurate prediction models will have great impacts on both disease prevention and early treatment strategies. Despite the identification of thousands of disease-associated genetic variants through genome wide association studies (GWAS), genetic risk prediction accuracy remains moderate for most diseases, which is largely due to the challenges in both identifying all the functionally relevant variants and accurately estimating their effect sizes in the presence of linkage disequilibrium. In this paper, we introduce AnnoPred, a principled framework that leverages diverse types of genomic and epigenomic functional annotations in genetic risk prediction for complex diseases. AnnoPred is trained using GWAS summary statistics in a Bayesian framework in which we explicitly model various functional annotations and allow for linkage disequilibrium estimated from reference genotype data. Compared with state-of-the-art risk prediction methods, AnnoPred achieves consistently improved prediction accuracy in both extensive simulations and real data.<br />Author summary Genetic risk prediction plays a significant role in precision medicine. Accurate prediction models could have great impact on disease prevention and early treatment strategies. For example, mutations in BRCA1 and BRCA2 have been used to evaluate women’s breast cancer risk and as a guideline for early screening. However, genetic risk prediction models also present important challenges, including extreme high-dimensionality, limited access to and efficient computational methods for individual-level genotype data. To make use of rich GWAS summary statistics, we propose a novel method to address these challenges by integrating genomic functional annotations, which have been successfully applied in GWAS to generate biological insights. We demonstrate the improvement in accuracy in both extensive simulation studies and real data analysis of breast cancer, Crohn’s disease, celiac disease, rheumatoid arthritis and type-II diabetes.

Subjects

Subjects :
Epigenomics
0301 basic medicine
Linkage disequilibrium
Computer science
Genome-wide association study
computer.software_genre
Genome
Linkage Disequilibrium
0302 clinical medicine
Mathematical and Statistical Techniques
Endocrinology
Databases, Genetic
Breast Tumors
Medicine and Health Sciences
Data Mining
Genetic risk
lcsh:QH301-705.5
0303 health sciences
Ecology
Chromosome Mapping
Genomics
Genome project
Functional Genomics
3. Good health
Functional annotation
Oncology
Computational Theory and Mathematics
030220 oncology & carcinogenesis
Data Interpretation, Statistical
Modeling and Simulation
Physical Sciences
Treatment strategy
Data mining
Risk assessment
Statistics (Mathematics)
Research Article
Endocrine Disorders
Quantitative Trait Loci
Immunology
Rheumatoid Arthritis
Single-nucleotide polymorphism
Computational biology
Biology
Research and Analysis Methods
Polymorphism, Single Nucleotide
Risk Assessment
Autoimmune Diseases
03 medical and health sciences
Cellular and Molecular Neuroscience
Rheumatology
Breast Cancer
Genetic variation
Genome-Wide Association Studies
Diabetes Mellitus
Genetics
Humans
Genetic Predisposition to Disease
Statistical Methods
Molecular Biology
Genetic Association Studies
Ecology, Evolution, Behavior and Systematics
030304 developmental biology
Proportional Hazards Models
Genome, Human
Arthritis
Genetic Variation
Biology and Life Sciences
Computational Biology
Cancers and Neoplasms
Human Genetics
Genome Analysis
Precision medicine
Genome Annotation
Human genetics
030104 developmental biology
lcsh:Biology (General)
Metabolic Disorders
Genetics of Disease
Disease prevention
Clinical Immunology
Clinical Medicine
computer
Predictive modelling
Mathematics
Forecasting

Details

ISSN :
15537358
Volume :
13
Database :
OpenAIRE
Journal :
PLOS Computational Biology
Accession number :
edsair.doi.dedup.....d0f0146f7a00e72286e665c4c3c93d96