Back to Search Start Over

Leveraging the T2T assembly to resolve rare and pathogenic inversions in reference genome gaps.

Authors :
Bilgrav Saether K
Eisfeldt J
Bengtsson JD
Lun MY
Grochowski CM
Mahmoud M
Chao HT
Rosenfeld JA
Liu P
Ek M
Schuy J
Ameur A
Dai H
Hwang JP
Sedlazeck FJ
Bi W
Marom R
Wincent J
Nordgren A
Carvalho CMB
Lindstrand A
Source :
Genome research [Genome Res] 2024 Nov 20; Vol. 34 (11), pp. 1785-1797. Date of Electronic Publication: 2024 Nov 20.
Publication Year :
2024

Abstract

Chromosomal inversions (INVs) are particularly challenging to detect due to their copy-number neutral state and association with repetitive regions. Inversions represent about 1/20 of all balanced structural chromosome aberrations and can lead to disease by gene disruption or altering regulatory regions of dosage-sensitive genes in cis Short-read genome sequencing (srGS) can only resolve ∼70% of cytogenetically visible inversions referred to clinical diagnostic laboratories, likely due to breakpoints in repetitive regions. Here, we study 12 inversions by long-read genome sequencing (lrGS) ( n = 9) or srGS ( n = 3) and resolve nine of them. In four cases, the inversion breakpoint region was missing from at least one of the human reference genomes (GRCh37, GRCh38, T2T-CHM13) and a reference agnostic analysis was needed. One of these cases, an INV9 mappable only in de novo assembled lrGS data using T2T-CHM13 disrupts EHMT1 consistent with a Mendelian diagnosis (Kleefstra syndrome 1; MIM#610253). Next, by pairwise comparison between T2T-CHM13, GRCh37, and GRCh38, as well as the chimpanzee and bonobo, we show that hundreds of megabases of sequence are missing from at least one human reference, highlighting that primate genomes contribute to genomic diversity. Aligning population genomic data to these regions indicated that these regions are variable between individuals. Our analysis emphasizes that T2T-CHM13 is necessary to maximize the value of lrGS for optimal inversion detection in clinical diagnostics. These results highlight the importance of leveraging diverse and comprehensive reference genomes to resolve unsolved molecular cases in rare diseases.<br /> (© 2024 Bilgrav Saether et al.; Published by Cold Spring Harbor Laboratory Press.)

Details

Language :
English
ISSN :
1549-5469
Volume :
34
Issue :
11
Database :
MEDLINE
Journal :
Genome research
Publication Type :
Academic Journal
Accession number :
39486878
Full Text :
https://doi.org/10.1101/gr.279346.124