1. Long-read proteogenomics to connect disease-associated sQTLs to the protein isoform effectors of disease.
- Author
-
Abood A, Mesner LD, Jeffery ED, Murali M, Lehe MD, Saquing J, Farber CR, and Sheynkman GM
- Subjects
- Humans, Osteoblasts metabolism, Polymorphism, Single Nucleotide, Quantitative Trait Loci, Genome-Wide Association Study, Protein Isoforms genetics, Bone Density genetics, Alternative Splicing genetics, Proteogenomics methods
- Abstract
A major fraction of loci identified by genome-wide association studies (GWASs) mediate alternative splicing, but mechanistic interpretation is hindered by the technical limitations of short-read RNA sequencing (RNA-seq), which cannot directly link splicing events to full-length protein isoforms. Long-read RNA-seq represents a powerful tool to characterize transcript isoforms, and recently, infer protein isoform existence. Here, we present an approach that integrates information from GWASs, splicing quantitative trait loci (sQTLs), and PacBio long-read RNA-seq in a disease-relevant model to infer the effects of sQTLs on the ultimate protein isoform products they encode. We demonstrate the utility of our approach using bone mineral density (BMD) GWAS data. We identified 1,863 sQTLs from the Genotype-Tissue Expression (GTEx) project in 732 protein-coding genes that colocalized with BMD associations (H4PP ≥ 0.75). We generated PacBio Iso-Seq data (N = ∼22 million full-length reads) on human osteoblasts, identifying 68,326 protein-coding isoforms, of which 17,375 (25%) were unannotated. By casting the sQTLs onto protein isoforms, we connected 809 sQTLs to 2,029 protein isoforms from 441 genes expressed in osteoblasts. Overall, we found that 74 sQTLs influenced isoforms likely impacted by nonsense-mediated decay and 190 that potentially resulted in the expression of unannotated protein isoforms. Finally, we functionally validated colocalizing sQTLs in TPM2, in which siRNA-mediated knockdown in osteoblasts showed two TPM2 isoforms with opposing effects on mineralization but exhibited no effect upon knockdown of the entire gene. Our approach should be to generalize across diverse clinical traits and to provide insights into protein isoform activities modulated by GWAS loci., Competing Interests: Declaration of interests The authors declare that they have no conflicts of interest with the contents of this article., (Copyright © 2024 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.)
- Published
- 2024
- Full Text
- View/download PDF