Back to Search Start Over

Rapid and accurate multi-phenotype imputation for millions of individuals

Authors :
Lin-Lin Gu
Hong-Shan Wu
Tian-Yi Liu
Yong-Jie Zhang
Jing-Cheng He
Xiao-Lei Liu
Zhi-Yong Wang
Guo-Bo Chen
Dan Jiang
Ming Fang
Source :
Nature Communications, Vol 16, Iss 1, Pp 1-13 (2025)
Publication Year :
2025
Publisher :
Nature Portfolio, 2025.

Abstract

Abstract Deep phenotyping can enhance the power of genetic analysis, including genome-wide association studies (GWAS), but the occurrence of missing phenotypes compromises the potential of such resources. Although many phenotypic imputation methods have been developed, the accurate imputation of millions of individuals remains challenging. In the present study, we have developed a multi-phenotype imputation method based on mixed fast random forest (PIXANT) by leveraging efficient machine learning (ML)-based algorithms. We demonstrate by extensive simulations that PIXANT is reliable, robust and highly resource-efficient. We then apply PIXANT to the UKB data of 277,301 unrelated White British citizens and 425 traits, and GWAS is subsequently performed on the imputed phenotypes, 18.4% more GWAS loci are identified than before imputation (8710 vs 7355). The increased statistical power of GWAS identified some additional candidate genes affecting heart rate, such as RNF220, SCN10A, and RGS6, suggesting that the use of imputed phenotype data from a large cohort may lead to the discovery of additional candidate genes for complex traits.

Subjects

Subjects :
Science

Details

Language :
English
ISSN :
20411723
Volume :
16
Issue :
1
Database :
Directory of Open Access Journals
Journal :
Nature Communications
Publication Type :
Academic Journal
Accession number :
edsdoj.7e4608ca484eaeace47e2575b94ad6
Document Type :
article
Full Text :
https://doi.org/10.1038/s41467-024-55496-0