Back to Search Start Over

Systematic analysis of paralogous regions in 41,755 exomes uncovers clinically relevant variation

Authors :
Wouter Steyaert
Lonneke Haer-Wigman
Rolph Pfundt
Debby Hellebrekers
Marloes Steehouwer
Juliet Hampstead
Elke de Boer
Alexander Stegmann
Helger Yntema
Erik-Jan Kamsteeg
Han Brunner
Alexander Hoischen
Christian Gilissen
Source :
Nature Communications, Vol 14, Iss 1, Pp 1-13 (2023)
Publication Year :
2023
Publisher :
Nature Portfolio, 2023.

Abstract

Abstract The short lengths of short-read sequencing reads challenge the analysis of paralogous genomic regions in exome and genome sequencing data. Most genetic variants within these homologous regions therefore remain unidentified in standard analyses. Here, we present a method (Chameleolyser) that accurately identifies single nucleotide variants and small insertions/deletions (SNVs/Indels), copy number variants and ectopic gene conversion events in duplicated genomic regions using whole-exome sequencing data. Application to a cohort of 41,755 exome samples yields 20,432 rare homozygous deletions and 2,529,791 rare SNVs/Indels, of which we show that 338,084 are due to gene conversion events. None of the SNVs/Indels are detectable using regular analysis techniques. Validation by high-fidelity long-read sequencing in 20 samples confirms >88% of called variants. Focusing on variation in known disease genes leads to a direct molecular diagnosis in 25 previously undiagnosed patients. Our method can readily be applied to existing exome data.

Subjects

Subjects :
Science

Details

Language :
English
ISSN :
20411723
Volume :
14
Issue :
1
Database :
Directory of Open Access Journals
Journal :
Nature Communications
Publication Type :
Academic Journal
Accession number :
edsdoj.b6807c8412374d2aa25e893290d2b4c6
Document Type :
article
Full Text :
https://doi.org/10.1038/s41467-023-42531-9