Back to Search
Start Over
PRSice-2: Polygenic Risk Score software for biobank-scale data
- Source :
- GigaScience
- Publication Year :
- 2019
- Publisher :
- Oxford University Press (OUP), 2019.
-
Abstract
- Background Polygenic risk score (PRS) analyses have become an integral part of biomedical research, exploited to gain insights into shared aetiology among traits, to control for genomic profile in experimental studies, and to strengthen causal inference, among a range of applications. Substantial efforts are now devoted to biobank projects to collect large genetic and phenotypic data, providing unprecedented opportunity for genetic discovery and applications. To process the large-scale data provided by such biobank resources, highly efficient and scalable methods and software are required. Results Here we introduce PRSice-2, an efficient and scalable software program for automating and simplifying PRS analyses on large-scale data. PRSice-2 handles both genotyped and imputed data, provides empirical association P-values free from inflation due to overfitting, supports different inheritance models, and can evaluate multiple continuous and binary target traits simultaneously. We demonstrate that PRSice-2 is dramatically faster and more memory-efficient than PRSice-1 and alternative PRS software, LDpred and lassosum, while having comparable predictive power. Conclusion PRSice-2's combination of efficiency and power will be increasingly important as data sizes grow and as the applications of PRS become more sophisticated, e.g., when incorporated into high-dimensional or gene set–based analyses. PRSice-2 is written in C++, with an R script for plotting, and is freely available for download from http://PRSice.info.
- Subjects :
- Big Data
Multifactorial Inheritance
Computer science
Quantitative Trait Loci
imputation
Health Informatics
Overfitting
Machine learning
computer.software_genre
03 medical and health sciences
0302 clinical medicine
Software
Technical Note
Animals
Humans
GWAS
030304 developmental biology
0303 health sciences
business.industry
Biobank
Computer Science Applications
Causal inference
polygenic risk score
Genomic Profile
Scalability
Predictive power
Artificial intelligence
business
computer
030217 neurology & neurosurgery
Imputation (genetics)
Genome-Wide Association Study
Subjects
Details
- ISSN :
- 2047217X
- Volume :
- 8
- Database :
- OpenAIRE
- Journal :
- GigaScience
- Accession number :
- edsair.doi.dedup.....f3d0f9cec4e942ea7e735dfa82a8ae8b
- Full Text :
- https://doi.org/10.1093/gigascience/giz082