27 results on '"Amariuta T"'
Search Results
2. Fine-mapping causal tissues and genes at disease-associated loci.
- Author
-
Strober BJ, Zhang MJ, Amariuta T, Rossen J, and Price AL
- Subjects
- Humans, Chromosome Mapping, Polymorphism, Single Nucleotide, Genome-Wide Association Study, Organ Specificity genetics, Genotype, Leukocytes, Mononuclear metabolism, Quantitative Trait Loci, Genetic Predisposition to Disease
- Abstract
Complex diseases often have distinct mechanisms spanning multiple tissues. We propose tissue-gene fine-mapping (TGFM), which infers the posterior inclusion probability (PIP) for each gene-tissue pair to mediate a disease locus by analyzing summary statistics and expression quantitative trait loci (eQTL) data; TGFM also assigns PIPs to non-mediated variants. TGFM accounts for co-regulation across genes and tissues and models uncertainty in cis-predicted expression models, enabling correct calibration. We applied TGFM to 45 UK Biobank diseases or traits using eQTL data from 38 Genotype-Tissue Expression (GTEx) tissues. TGFM identified an average of 147 PIP > 0.5 causal genetic elements per disease or trait, of which 11% were gene-tissue pairs. Causal gene-tissue pairs identified by TGFM reflected both known biology (for example, TPO-thyroid for hypothyroidism) and biologically plausible findings (for example, SLC20A2-artery aorta for diastolic blood pressure). Application of TGFM to single-cell eQTL data from nine cell types in peripheral blood mononuclear cells (PBMCs), analyzed jointly with GTEx tissues, identified 30 additional causal gene-PBMC cell type pairs., Competing Interests: Competing interests: The authors declare no competing interests., (© 2025. The Author(s), under exclusive licence to Springer Nature America, Inc.)
- Published
- 2025
- Full Text
- View/download PDF
3. Paired analysis of host and pathogen genomes identifies determinants of human tuberculosis.
- Author
-
Luo Y, Huang CC, Howard NC, Wang X, Liu Q, Li X, Zhu J, Amariuta T, Asgari S, Ishigaki K, Calderon R, Raman S, Ramnarine AK, Mayfield JA, Moody DB, Lecca L, Fortune SM, Murray MB, and Raychaudhuri S
- Subjects
- Humans, Peru epidemiology, Genome, Bacterial, COVID-19 genetics, COVID-19 virology, Macrophages microbiology, Female, Polymorphism, Single Nucleotide, Male, Cohort Studies, Mutation, Adult, Thioredoxin-Disulfide Reductase genetics, Thioredoxin-Disulfide Reductase metabolism, SARS-CoV-2 genetics, Mycobacterium tuberculosis genetics, Mycobacterium tuberculosis pathogenicity, Tuberculosis microbiology, Tuberculosis genetics, Host-Pathogen Interactions genetics
- Abstract
Infectious disease is the result of interactions between host and pathogen and can depend on genetic variations in both. We conduct a genome-to-genome study of paired human and Mycobacterium tuberculosis genomes from a cohort of 1556 tuberculosis patients in Lima, Peru. We identify an association between a human intronic variant (rs3130660, OR = 10.06, 95%CI: 4.87 - 20.77, P = 7.92 × 10
-8 ) in the FLOT1 gene and a subclavaluee of Mtb Lineage 2. In a human macrophage infection model, we observe hosts with the rs3130660-A allele exhibited stronger interferon gene signatures. The interacting strains have altered redox states due to a thioredoxin reductase mutation. We investigate this association in a 2020 cohort of 699 patients recruited during the COVID-19 pandemic. While the prevalence of the interacting strain almost doubled between 2010 and 2020, its infection is not associated with rs3130660 in this recent cohort. These findings suggest a complex interplay among host, pathogen, and environmental factors in tuberculosis dynamics., Competing Interests: Competing interests: The authors declare no competing interests., (© 2024. The Author(s).)- Published
- 2024
- Full Text
- View/download PDF
4. Powerful mapping of cis -genetic effects on gene expression across diverse populations reveals novel disease-critical genes.
- Author
-
Akamatsu K, Golzari S, and Amariuta T
- Abstract
While disease-associated variants identified by genome-wide association studies (GWAS) most likely regulate gene expression levels, linking variants to target genes is critical to determining the functional mechanisms of these variants. Genetic effects on gene expression have been extensively characterized by expression quantitative trait loci (eQTL) studies, yet data from non-European populations is limited. This restricts our understanding of disease to genes whose regulatory variants are common in European populations. While previous work has leveraged data from multiple populations to improve GWAS power and polygenic risk score (PRS) accuracy, multi-ancestry data has not yet been used to better estimate cis -genetic effects on gene expression. Here, we present a new method, Multi-Ancestry Gene Expression Prediction Regularized Optimization (MAGEPRO), which constructs robust genetic models of gene expression in understudied populations or cell types by fitting a regularized linear combination of eQTL summary data across diverse cohorts. In simulations, our tool generates more accurate models of gene expression than widely-used LASSO and the state-of-the-art multi-ancestry PRS method, PRS-CSx, adapted to gene expression prediction. We attribute this improvement to MAGEPRO's ability to more accurately estimate causal eQTL effect sizes ( p < 3.98 × 10
-4 , two-sided paired t-test). With real data, we applied MAGEPRO to 8 eQTL cohorts representing 3 ancestries (average n = 355) and consistently outperformed each of 6 competing methods in gene expression prediction tasks. Integration with GWAS summary statistics across 66 complex traits (representing 22 phenotypes and 3 ancestries) resulted in 2,331 new gene-trait associations, many of which replicate across multiple ancestries, including PHTF1 linked to white blood cell count, a gene which is overexpressed in leukemia patients. MAGEPRO also identified biologically plausible novel findings, such as PIGB, an essential component of GPI biosynthesis, associated with heart failure, which has been previously evidenced by clinical outcome data. Overall, MAGEPRO is a powerful tool to enhance inference of gene regulatory effects in underpowered datasets and has improved our understanding of population-specific and shared genetic effects on complex traits.- Published
- 2024
- Full Text
- View/download PDF
5. Genome-wide discovery and integrative genomic characterization of insulin resistance loci using serum triglycerides to HDL-cholesterol ratio as a proxy.
- Author
-
DeForest N, Wang Y, Zhu Z, Dron JS, Koesterer R, Natarajan P, Flannick J, Amariuta T, Peloso GM, and Majithia AR
- Subjects
- Humans, Female, Male, Quantitative Trait Loci, Middle Aged, Adult, Polymorphism, Single Nucleotide, Genomics methods, Insulin Resistance genetics, Genome-Wide Association Study, Triglycerides blood, Cholesterol, HDL blood, Cholesterol, HDL genetics
- Abstract
Insulin resistance causes multiple epidemic metabolic diseases, including type 2 diabetes, cardiovascular disease, and fatty liver, but is not routinely measured in epidemiological studies. To discover novel insulin resistance genes in the general population, we conducted genome-wide association studies in 382,129 individuals for triglyceride to HDL-cholesterol ratio (TG/HDL), a surrogate marker of insulin resistance calculable from commonly measured serum lipid profiles. We identified 251 independent loci, of which 62 were more strongly associated with TG/HDL compared to TG or HDL alone, suggesting them as insulin resistance loci. Candidate causal genes at these loci were prioritized by fine mapping with directions-of-effect and tissue specificity annotated through analysis of protein coding and expression quantitative trait variation. Directions-of-effect were corroborated in an independent cohort of individuals with directly measured insulin resistance. We highlight two phospholipase encoding genes, PLA2G12A and PLA2G6, which liberate arachidonic acid and improve insulin sensitivity, and VGLL3, a transcriptional co-factor that increases insulin resistance partially through enhanced adiposity. Finally, we implicate the anti-apoptotic gene TNFAIP8 as a sex-dimorphic insulin resistance factor, which acts by increasing visceral adiposity, specifically in females. In summary, our study identifies several candidate modulators of insulin resistance that have the potential to serve as biomarkers and pharmacological targets., (© 2024. This is a U.S. Government work and not under copyright protection in the US; foreign copyright protection may apply.)
- Published
- 2024
- Full Text
- View/download PDF
6. Fine-mapping causal tissues and genes at disease-associated loci.
- Author
-
Strober BJ, Zhang MJ, Amariuta T, Rossen J, and Price AL
- Abstract
Heritable diseases often manifest in a highly tissue-specific manner, with different disease loci mediated by genes in distinct tissues or cell types. We propose Tissue-Gene Fine-Mapping (TGFM), a fine-mapping method that infers the posterior probability (PIP) for each gene-tissue pair to mediate a disease locus by analyzing GWAS summary statistics (and in-sample LD) and leveraging eQTL data from diverse tissues to build cis-predicted expression models; TGFM also assigns PIPs to causal variants that are not mediated by gene expression in assayed genes and tissues. TGFM accounts for both co-regulation across genes and tissues and LD between SNPs (generalizing existing fine-mapping methods), and incorporates genome-wide estimates of each tissue's contribution to disease as tissue-level priors. TGFM was well-calibrated and moderately well-powered in simulations; unlike previous methods, TGFM was able to attain correct calibration by modeling uncertainty in cis-predicted expression models. We applied TGFM to 45 UK Biobank diseases/traits (average N = 316K) using eQTL data from 38 GTEx tissues. TGFM identified an average of 147 PIP > 0.5 causal genetic elements per disease/trait, of which 11% were gene-tissue pairs. Implicated gene-tissue pairs were concentrated in known disease-critical tissues, and causal genes were strongly enriched in disease-relevant gene sets. Causal gene-tissue pairs identified by TGFM recapitulated known biology (e.g., TPO -thyroid for Hypothyroidism), but also included biologically plausible novel findings (e.g., SLC20A2 -artery aorta for Diastolic blood pressure). Further application of TGFM to single-cell eQTL data from 9 cell types in peripheral blood mononuclear cells (PBMC), analyzed jointly with GTEx tissues, identified 30 additional causal gene-PBMC cell type pairs at PIP > 0.5-primarily for autoimmune disease and blood cell traits, including the biologically plausible example of CD52 in classical monocyte cells for Monocyte count. In conclusion, TGFM is a robust and powerful method for fine-mapping causal tissues and genes at disease-associated loci.
- Published
- 2024
- Full Text
- View/download PDF
7. GWAS for systemic sclerosis identifies six novel susceptibility loci including one in the Fcγ receptor region.
- Author
-
Ishikawa Y, Tanaka N, Asano Y, Kodera M, Shirai Y, Akahoshi M, Hasegawa M, Matsushita T, Saito K, Motegi SI, Yoshifuji H, Yoshizaki A, Kohmoto T, Takagi K, Oka A, Kanda M, Tanaka Y, Ito Y, Nakano K, Kasamatsu H, Utsunomiya A, Sekiguchi A, Niiro H, Jinnin M, Makino K, Makino T, Ihn H, Yamamoto M, Suzuki C, Takahashi H, Nishida E, Morita A, Yamamoto T, Fujimoto M, Kondo Y, Goto D, Sumida T, Ayuzawa N, Yanagida H, Horita T, Atsumi T, Endo H, Shima Y, Kumanogoh A, Hirata J, Otomo N, Suetsugu H, Koike Y, Tomizuka K, Yoshino S, Liu X, Ito S, Hikino K, Suzuki A, Momozawa Y, Ikegawa S, Tanaka Y, Ishikawa O, Takehara K, Torii T, Sato S, Okada Y, Mimori T, Matsuda F, Matsuda K, Amariuta T, Imoto I, Matsuo K, Kuwana M, Kawaguchi Y, Ohmura K, and Terao C
- Subjects
- Humans, Genome-Wide Association Study, Receptors, IgG genetics, Genetic Risk Score, Polymorphism, Single Nucleotide, Interferon Regulatory Factors genetics, Genetic Loci, Genetic Predisposition to Disease genetics, Scleroderma, Systemic genetics
- Abstract
Here we report the largest Asian genome-wide association study (GWAS) for systemic sclerosis performed to date, based on data from Japanese subjects and comprising of 1428 cases and 112,599 controls. The lead SNP is in the FCGR/FCRL region, which shows a penetrating association in the Asian population, while a complete linkage disequilibrium SNP, rs10917688, is found in a cis-regulatory element for IRF8. IRF8 is also a significant locus in European GWAS for systemic sclerosis, but rs10917688 only shows an association in the presence of the risk allele of IRF8 in the Japanese population. Further analysis shows that rs10917688 is marked with H3K4me1 in primary B cells. A meta-analysis with a European GWAS detects 30 additional significant loci. Polygenic risk scores constructed with the effect sizes of the meta-analysis suggest the potential portability of genetic associations beyond populations. Prioritizing the top 5% of SNPs of IRF8 binding sites in B cells improves the fitting of the polygenic risk scores, underscoring the roles of B cells and IRF8 in the development of systemic sclerosis. The results also suggest that systemic sclerosis shares a common genetic architecture across populations., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
8. High-dimensional phenotyping to define the genetic basis of cellular morphology.
- Author
-
Tegtmeyer M, Arora J, Asgari S, Cimini BA, Nadig A, Peirent E, Liyanage D, Way GP, Weisbart E, Nathan A, Amariuta T, Eggan K, Haghighi M, McCarroll SA, O'Connor L, Carpenter AE, Singh S, Nehme R, and Raychaudhuri S
- Subjects
- Chromosome Mapping, Cell Nucleus, Cell Shape, Mutant Proteins, Quantitative Trait Loci genetics, Induced Pluripotent Stem Cells
- Abstract
The morphology of cells is dynamic and mediated by genetic and environmental factors. Characterizing how genetic variation impacts cell morphology can provide an important link between disease association and cellular function. Here, we combine genomic sequencing and high-content imaging approaches on iPSCs from 297 unique donors to investigate the relationship between genetic variants and cellular morphology to map what we term cell morphological quantitative trait loci (cmQTLs). We identify novel associations between rare protein altering variants in WASF2, TSPAN15, and PRLR with several morphological traits related to cell shape, nucleic granularity, and mitochondrial distribution. Knockdown of these genes by CRISPRi confirms their role in cell morphology. Analysis of common variants yields one significant association and nominate over 300 variants with suggestive evidence (P < 10
-6 ) of association with one or more morphology traits. We then use these data to make predictions about sample size requirements for increasing discovery in cellular genetic studies. We conclude that, similar to molecular phenotypes, morphological profiling can yield insight about the function of genes and variants., (© 2024. The Author(s).)- Published
- 2024
- Full Text
- View/download PDF
9. Dynamic regulatory elements in single-cell multimodal data implicate key immune cell states enriched for autoimmune disease heritability.
- Author
-
Gupta A, Weinand K, Nathan A, Sakaue S, Zhang MJ, Donlin L, Wei K, Price AL, Amariuta T, and Raychaudhuri S
- Subjects
- Humans, Regulatory Sequences, Nucleic Acid genetics, Chromosomes, Genome, Human, Chromatin genetics, Autoimmune Diseases genetics
- Abstract
In autoimmune diseases such as rheumatoid arthritis, the immune system attacks the body's own cells. Developing a precise understanding of the cell states where noncoding autoimmune risk variants impart causal mechanisms is critical to developing curative therapies. Here, to identify noncoding regions with accessible chromatin that associate with cell-state-defining gene expression patterns, we leveraged multimodal single-nucleus RNA and assay for transposase-accessible chromatin (ATAC) sequencing data across 28,674 cells from the inflamed synovial tissue of 12 donors. Specifically, we used a multivariate Poisson model to predict peak accessibility from single-nucleus RNA sequencing principal components. For 14 autoimmune diseases, we discovered that cell-state-dependent ('dynamic') chromatin accessibility peaks in immune cell types were enriched for heritability, compared with cell-state-invariant ('cs-invariant') peaks. These dynamic peaks marked regulatory elements associated with T peripheral helper, regulatory T, dendritic and STAT1
+ CXCL10+ myeloid cell states. We argue that dynamic regulatory elements can help identify precise cell states enriched for disease-critical genetic variation., (© 2023. The Author(s), under exclusive licence to Springer Nature America, Inc.)- Published
- 2023
- Full Text
- View/download PDF
10. The power paradox of detecting disease-associated and gene-expression-associated variants.
- Author
-
Amariuta T
- Subjects
- Humans, Gene Expression, Genetic Variation, Genetic Predisposition to Disease
- Published
- 2023
- Full Text
- View/download PDF
11. Modeling tissue co-regulation estimates tissue-specific contributions to disease.
- Author
-
Amariuta T, Siewert-Rocks K, and Price AL
- Subjects
- Humans, Quantitative Trait Loci, Polymorphism, Single Nucleotide, Transcriptome genetics, Genetic Predisposition to Disease, Genome-Wide Association Study methods
- Abstract
Integrative analyses of genome-wide association studies and gene expression data have implicated many disease-critical tissues. However, co-regulation of genetic effects on gene expression across tissues impedes distinguishing biologically causal tissues from tagging tissues. In the present study, we introduce tissue co-regulation score regression (TCSC), which disentangles causal tissues from tagging tissues by regressing gene-disease association statistics (from transcriptome-wide association studies) on tissue co-regulation scores, reflecting correlations of predicted gene expression across genes and tissues. We applied TCSC to 78 diseases/traits (average n = 302,000) and gene expression prediction models for 48 GTEx tissues. TCSC identified 21 causal tissue-trait pairs at a 5% false discovery rate (FDR), including well-established findings, biologically plausible new findings (for example, aorta artery and glaucoma) and increased specificity of known tissue-trait associations (for example, subcutaneous adipose, but not visceral adipose, and high-density lipoprotein). TCSC also identified 17 causal tissue-trait covariance pairs at 5% FDR. In conclusion, TCSC is a precise method for distinguishing causal tissues from tagging tissues., (© 2023. The Author(s), under exclusive licence to Springer Nature America, Inc.)
- Published
- 2023
- Full Text
- View/download PDF
12. Genetics of skeletal proportions in two different populations.
- Author
-
Bartell E, Lin K, Tsuo K, Gan W, Vedantam S, Cole JB, Baronas JM, Yengo L, Marouli E, Amariuta T, Chen Z, Li L, Renthal NE, Jacobsen CM, Salem RM, Walters RG, and Hirschhorn JN
- Abstract
Human height can be divided into sitting height and leg length, reflecting growth of different parts of the skeleton whose relative proportions are captured by the ratio of sitting to total height (as sitting height ratio, SHR). Height is a highly heritable trait, and its genetic basis has been well-studied. However, the genetic determinants of skeletal proportion are much less well-characterized. Expanding substantially on past work, we performed a genome-wide association study (GWAS) of SHR in ∼450,000 individuals with European ancestry and ∼100,000 individuals with East Asian ancestry from the UK and China Kadoorie Biobanks. We identified 565 loci independently associated with SHR, including all genomic regions implicated in prior GWAS in these ancestries. While SHR loci largely overlap height-associated loci (P < 0.001), the fine-mapped SHR signals were often distinct from height. We additionally used fine-mapped signals to identify 36 credible sets with heterogeneous effects across ancestries. Lastly, we used SHR, sitting height, and leg length to identify genetic variation acting on specific body regions rather than on overall human height.
- Published
- 2023
- Full Text
- View/download PDF
13. Multi-ancestry genome-wide association analyses identify novel genetic mechanisms in rheumatoid arthritis.
- Author
-
Ishigaki K, Sakaue S, Terao C, Luo Y, Sonehara K, Yamaguchi K, Amariuta T, Too CL, Laufer VA, Scott IC, Viatte S, Takahashi M, Ohmura K, Murasawa A, Hashimoto M, Ito H, Hammoudeh M, Emadi SA, Masri BK, Halabi H, Badsha H, Uthman IW, Wu X, Lin L, Li T, Plant D, Barton A, Orozco G, Verstappen SMM, Bowes J, MacGregor AJ, Honda S, Koido M, Tomizuka K, Kamatani Y, Tanaka H, Tanaka E, Suzuki A, Maeda Y, Yamamoto K, Miyawaki S, Xie G, Zhang J, Amos CI, Keystone E, Wolbink G, van der Horst-Bruinsma I, Cui J, Liao KP, Carroll RJ, Lee HS, Bang SY, Siminovitch KA, de Vries N, Alfredsson L, Rantapää-Dahlqvist S, Karlson EW, Bae SC, Kimberly RP, Edberg JC, Mariette X, Huizinga T, Dieudé P, Schneider M, Kerick M, Denny JC, Matsuda K, Matsuo K, Mimori T, Matsuda F, Fujio K, Tanaka Y, Kumanogoh A, Traylor M, Lewis CM, Eyre S, Xu H, Saxena R, Arayssi T, Kochi Y, Ikari K, Harigai M, Gregersen PK, Yamamoto K, Louis Bridges S Jr, Padyukov L, Martin J, Klareskog L, Okada Y, and Raychaudhuri S
- Subjects
- Humans, Genetic Predisposition to Disease, Polymorphism, Single Nucleotide genetics, Asian People genetics, Adaptor Proteins, Signal Transducing genetics, Genome-Wide Association Study, Arthritis, Rheumatoid genetics
- Abstract
Rheumatoid arthritis (RA) is a highly heritable complex disease with unknown etiology. Multi-ancestry genetic research of RA promises to improve power to detect genetic signals, fine-mapping resolution and performances of polygenic risk scores (PRS). Here, we present a large-scale genome-wide association study (GWAS) of RA, which includes 276,020 samples from five ancestral groups. We conducted a multi-ancestry meta-analysis and identified 124 loci (P < 5 × 10
-8 ), of which 34 are novel. Candidate genes at the novel loci suggest essential roles of the immune system (for example, TNIP2 and TNFRSF11A) and joint tissues (for example, WISP1) in RA etiology. Multi-ancestry fine-mapping identified putatively causal variants with biological insights (for example, LEF1). Moreover, PRS based on multi-ancestry GWAS outperformed PRS based on single-ancestry GWAS and had comparable performance between populations of European and East Asian ancestries. Our study provides several insights into the etiology of RA and improves the genetic predictability of RA., (© 2022. The Author(s), under exclusive licence to Springer Nature America, Inc.)- Published
- 2022
- Full Text
- View/download PDF
14. Single-cell eQTL models reveal dynamic T cell state dependence of disease loci.
- Author
-
Nathan A, Asgari S, Ishigaki K, Valencia C, Amariuta T, Luo Y, Beynor JI, Baglaenko Y, Suliman S, Price AL, Lecca L, Murray MB, Moody DB, and Raychaudhuri S
- Subjects
- Gene Expression Regulation, Humans, Peru, Genetic Predisposition to Disease genetics, Memory T Cells immunology, Memory T Cells metabolism, Quantitative Trait Loci genetics
- Abstract
Non-coding genetic variants may cause disease by modulating gene expression. However, identifying these expression quantitative trait loci (eQTLs) is complicated by differences in gene regulation across fluid functional cell states within cell types. These states-for example, neurotransmitter-driven programs in astrocytes or perivascular fibroblast differentiation-are obscured in eQTL studies that aggregate cells
1,2 . Here we modelled eQTLs at single-cell resolution in one complex cell type: memory T cells. Using more than 500,000 unstimulated memory T cells from 259 Peruvian individuals, we show that around one-third of 6,511 cis-eQTLs had effects that were mediated by continuous multimodally defined cell states, such as cytotoxicity and regulatory capacity. In some loci, independent eQTL variants had opposing cell-state relationships. Autoimmune variants were enriched in cell-state-dependent eQTLs, including risk variants for rheumatoid arthritis near ORMDL3 and CTLA4; this indicates that cell-state context is crucial to understanding potential eQTL pathogenicity. Moreover, continuous cell states explained more variation in eQTLs than did conventional discrete categories, such as CD4+ versus CD8+ , suggesting that modelling eQTLs and cell states at single-cell resolution can expand insight into gene regulation in functionally heterogeneous cell types., (© 2022. The Author(s), under exclusive licence to Springer Nature Limited.)- Published
- 2022
- Full Text
- View/download PDF
15. Population-specific causal disease effect sizes in functionally important regions impacted by selection.
- Author
-
Shi H, Gazal S, Kanai M, Koch EM, Schoech AP, Siewert KM, Kim SS, Luo Y, Amariuta T, Huang H, Okada Y, Raychaudhuri S, Sunyaev SR, and Price AL
- Subjects
- Algorithms, Asian People genetics, Genomics methods, Haplotypes genetics, Humans, Models, Genetic, White People genetics, Genetics, Population methods, Genome-Wide Association Study methods, Linkage Disequilibrium, Polymorphism, Single Nucleotide, Selection, Genetic
- Abstract
Many diseases exhibit population-specific causal effect sizes with trans-ethnic genetic correlations significantly less than 1, limiting trans-ethnic polygenic risk prediction. We develop a new method, S-LDXR, for stratifying squared trans-ethnic genetic correlation across genomic annotations, and apply S-LDXR to genome-wide summary statistics for 31 diseases and complex traits in East Asians (average N = 90K) and Europeans (average N = 267K) with an average trans-ethnic genetic correlation of 0.85. We determine that squared trans-ethnic genetic correlation is 0.82× (s.e. 0.01) depleted in the top quintile of background selection statistic, implying more population-specific causal effect sizes. Accordingly, causal effect sizes are more population-specific in functionally important regions, including conserved and regulatory regions. In regions surrounding specifically expressed genes, causal effect sizes are most population-specific for skin and immune genes, and least population-specific for brain genes. Our results could potentially be explained by stronger gene-environment interaction at loci impacted by selection, particularly positive selection.
- Published
- 2021
- Full Text
- View/download PDF
16. Improving the trans-ancestry portability of polygenic risk scores by prioritizing variants in predicted cell-type-specific regulatory elements.
- Author
-
Amariuta T, Ishigaki K, Sugishita H, Ohta T, Koido M, Dey KK, Matsuda K, Murakami Y, Price AL, Kawakami E, Terao C, and Raychaudhuri S
- Subjects
- Base Sequence, Computational Biology methods, Gene Expression Regulation genetics, Genome-Wide Association Study, Humans, Models, Genetic, Molecular Sequence Annotation, Multifactorial Inheritance genetics, Asian People genetics, Enhancer Elements, Genetic genetics, Genetic Predisposition to Disease genetics, Polymorphism, Single Nucleotide genetics, White People genetics
- Abstract
Poor trans-ancestry portability of polygenic risk scores is a consequence of Eurocentric genetic studies and limited knowledge of shared causal variants. Leveraging regulatory annotations may improve portability by prioritizing functional over tagging variants. We constructed a resource of 707 cell-type-specific IMPACT regulatory annotations by aggregating 5,345 epigenetic datasets to predict binding patterns of 142 transcription factors across 245 cell types. We then partitioned the common SNP heritability of 111 genome-wide association study summary statistics of European (average n ≈ 189,000) and East Asian (average n ≈ 157,000) origin. IMPACT annotations captured consistent SNP heritability between populations, suggesting prioritization of shared functional variants. Variant prioritization using IMPACT resulted in increased trans-ancestry portability of polygenic risk scores from Europeans to East Asians across all 21 phenotypes analyzed (49.9% mean relative increase in R
2 ). Our study identifies a crucial role for functional annotations such as IMPACT to improve the trans-ancestry portability of genetic data.- Published
- 2020
- Full Text
- View/download PDF
17. Large-scale genome-wide association study in a Japanese population identifies novel susceptibility loci across different diseases.
- Author
-
Ishigaki K, Akiyama M, Kanai M, Takahashi A, Kawakami E, Sugishita H, Sakaue S, Matoba N, Low SK, Okada Y, Terao C, Amariuta T, Gazal S, Kochi Y, Horikoshi M, Suzuki K, Ito K, Koyama S, Ozaki K, Niida S, Sakata Y, Sakata Y, Kohno T, Shiraishi K, Momozawa Y, Hirata M, Matsuda K, Ikeda M, Iwata N, Ikegawa S, Kou I, Tanaka T, Nakagawa H, Suzuki A, Hirota T, Tamari M, Chayama K, Miki D, Mori M, Nagayama S, Daigo Y, Miki Y, Katagiri T, Ogawa O, Obara W, Ito H, Yoshida T, Imoto I, Takahashi T, Tanikawa C, Suzuki T, Sinozaki N, Minami S, Yamaguchi H, Asai S, Takahashi Y, Yamaji K, Takahashi K, Fujioka T, Takata R, Yanai H, Masumoto A, Koretsune Y, Kutsumi H, Higashiyama M, Murayama S, Minegishi N, Suzuki K, Tanno K, Shimizu A, Yamaji T, Iwasaki M, Sawada N, Uemura H, Tanaka K, Naito M, Sasaki M, Wakai K, Tsugane S, Yamamoto M, Yamamoto K, Murakami Y, Nakamura Y, Raychaudhuri S, Inazawa J, Yamauchi T, Kadowaki T, Kubo M, and Kamatani Y
- Subjects
- Cohort Studies, Female, Genetic Variation, Humans, Inheritance Patterns, Japan, Male, Sex Factors, Transcription Factors genetics, Genetic Predisposition to Disease ethnology, Genome-Wide Association Study
- Abstract
The overwhelming majority of participants in current genetic studies are of European ancestry. To elucidate disease biology in the East Asian population, we conducted a genome-wide association study (GWAS) with 212,453 Japanese individuals across 42 diseases. We detected 320 independent signals in 276 loci for 27 diseases, with 25 novel loci (P < 9.58 × 10
-9 ). East Asian-specific missense variants were identified as candidate causal variants for three novel loci, and we successfully replicated two of them by analyzing independent Japanese cohorts; p.R220W of ATG16L2 (associated with coronary artery disease) and p.V326A of POT1 (associated with lung cancer). We further investigated enrichment of heritability within 2,868 annotations of genome-wide transcription factor occupancy, and identified 378 significant enrichments across nine diseases (false discovery rate < 0.05) (for example, NKX3-1 for prostate cancer). This large-scale GWAS in a Japanese population provides insights into the etiology of complex diseases and highlights the importance of performing GWAS in non-European populations.- Published
- 2020
- Full Text
- View/download PDF
18. Annotations capturing cell type-specific TF binding explain a large fraction of disease heritability.
- Author
-
van de Geijn B, Finucane H, Gazal S, Hormozdiari F, Amariuta T, Liu X, Gusev A, Loh PR, Reshef Y, Kichaev G, Raychauduri S, and Price AL
- Subjects
- Binding Sites genetics, Computational Biology, Gene Expression Regulation genetics, Genetic Diseases, Inborn classification, Genetic Diseases, Inborn pathology, Humans, Linkage Disequilibrium genetics, Multifactorial Inheritance genetics, Polymorphism, Single Nucleotide genetics, Protein Binding genetics, Chromatin genetics, Genetic Diseases, Inborn genetics, Molecular Sequence Annotation, Transcription Factors genetics
- Abstract
Regulatory variation plays a major role in complex disease and that cell type-specific binding of transcription factors (TF) is critical to gene regulation. However, assessing the contribution of genetic variation in TF-binding sites to disease heritability is challenging, as binding is often cell type-specific and annotations from directly measured TF binding are not currently available for most cell type-TF pairs. We investigate approaches to annotate TF binding, including directly measured chromatin data and sequence-based predictions. We find that TF-binding annotations constructed by intersecting sequence-based TF-binding predictions with cell type-specific chromatin data explain a large fraction of heritability across a broad set of diseases and corresponding cell types; this strategy of constructing annotations addresses both the limitation that identical sequences may be bound or unbound depending on surrounding chromatin context and the limitation that sequence-based predictions are generally not cell type-specific. We partitioned the heritability of 49 diseases and complex traits using stratified linkage disequilibrium (LD) score regression with the baseline-LD model (which is not cell type-specific) plus the new annotations. We determined that 100 bp windows around MotifMap sequenced-based TF-binding predictions intersected with a union of six cell type-specific chromatin marks (imputed using ChromImpute) performed best, with an 58% increase in heritability enrichment compared to the chromatin marks alone (11.6× vs. 7.3×, P = 9 × 10-14 for difference) and a 20% increase in cell type-specific signal conditional on annotations from the baseline-LD model (P = 8 × 10-11 for difference). Our results show that TF-binding annotations explain substantial disease heritability and can help refine genome-wide association signals., (© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.)
- Published
- 2020
- Full Text
- View/download PDF
19. Advances in genetics toward identifying pathogenic cell states of rheumatoid arthritis.
- Author
-
Amariuta T, Luo Y, Knevel R, Okada Y, and Raychaudhuri S
- Subjects
- Animals, Genetic Association Studies, Genetic Predisposition to Disease, Humans, Immunologic Memory, Risk, Arthritis, Rheumatoid genetics, B-Lymphocytes physiology, CD4-Positive T-Lymphocytes physiology, Fibroblasts physiology, Monocytes physiology
- Abstract
Rheumatoid arthritis (RA) risk has a large genetic component (~60%) that is still not fully understood. This has hampered the design of effective treatments that could promise lifelong remission. RA is a polygenic disease with 106 known genome-wide significant associated loci and thousands of small effect causal variants. Our current understanding of RA risk has suggested cell-type-specific contexts for causal variants, implicating CD4 + effector memory T cells, as well as monocytes, B cells and stromal fibroblasts. While these cellular states and categories are still mechanistically broad, future studies may identify causal cell subpopulations. These efforts are propelled by advances in single cell profiling. Identification of causal cell subpopulations may accelerate therapeutic intervention to achieve lifelong remission., (© 2019 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.)
- Published
- 2020
- Full Text
- View/download PDF
20. Allele-specific expression changes dynamically during T cell activation in HLA and other autoimmune loci.
- Author
-
Gutierrez-Arcelus M, Baglaenko Y, Arora J, Hannes S, Luo Y, Amariuta T, Teslovich N, Rao DA, Ermann J, Jonsson AH, Navarrete C, Rich SS, Taylor KD, Rotter JI, Gregersen PK, Esko T, Brenner MB, and Raychaudhuri S
- Subjects
- Alleles, CD4-Positive T-Lymphocytes, CRISPR-Cas Systems, Cell Line, Gene Expression Regulation, Genetic Loci, Genotyping Techniques, HLA Antigens metabolism, HLA-DQ beta-Chains metabolism, Humans, Immunity, Cellular, T-Lymphocytes, Regulatory, Autoimmunity genetics, Genetic Variation, HLA Antigens genetics, HLA-DQ beta-Chains genetics, Lymphocyte Activation genetics, Promoter Regions, Genetic genetics
- Abstract
Genetic studies have revealed that autoimmune susceptibility variants are over-represented in memory CD4
+ T cell regulatory elements1-3 . Understanding how genetic variation affects gene expression in different T cell physiological states is essential for deciphering genetic mechanisms of autoimmunity4,5 . Here, we characterized the dynamics of genetic regulatory effects at eight time points during memory CD4+ T cell activation with high-depth RNA-seq in healthy individuals. We discovered widespread, dynamic allele-specific expression across the genome, where the balance of alleles changes over time. These genes were enriched fourfold within autoimmune loci. We found pervasive dynamic regulatory effects within six HLA genes. HLA-DQB1 alleles had one of three distinct transcriptional regulatory programs. Using CRISPR-Cas9 genomic editing we demonstrated that a promoter variant is causal for T cell-specific control of HLA-DQB1 expression. Our study shows that genetic variation in cis-regulatory elements affects gene expression in a manner dependent on lymphocyte activation status, contributing to the interindividual complexity of immune responses.- Published
- 2020
- Full Text
- View/download PDF
21. Genes with High Network Connectivity Are Enriched for Disease Heritability.
- Author
-
Kim SS, Dai C, Hormozdiari F, van de Geijn B, Gazal S, Park Y, O'Connor L, Amariuta T, Loh PR, Finucane H, Raychaudhuri S, and Price AL
- Published
- 2019
- Full Text
- View/download PDF
22. Early progression to active tuberculosis is a highly heritable trait driven by 3q23 in Peruvians.
- Author
-
Luo Y, Suliman S, Asgari S, Amariuta T, Baglaenko Y, Martínez-Bonet M, Ishigaki K, Gutierrez-Arcelus M, Calderon R, Lecca L, León SR, Jimenez J, Yataco R, Contreras C, Galea JT, Becerra M, Nejentsev S, Nigrovic PA, Moody DB, Murray MB, and Raychaudhuri S
- Subjects
- Adult, Female, Gene Expression, Genetic Loci, Genome-Wide Association Study, Genotype, Humans, Male, Monocytes, Mycobacterium tuberculosis genetics, Peru, Sodium-Potassium-Exchanging ATPase genetics, Disease Progression, Mycobacterium tuberculosis pathogenicity, Tuberculosis genetics
- Abstract
Of the 1.8 billion people worldwide infected with Mycobacterium tuberculosis, 5-15% will develop active tuberculosis (TB). Approximately half will progress to active TB within the first 18 months after infection, presumably because they fail to mount an effective initial immune response. Here, in a genome-wide genetic study of early TB progression, we genotype 4002 active TB cases and their household contacts in Peru. We quantify genetic heritability ([Formula: see text]) of early TB progression to be 21.2% (standard error 0.08). This suggests TB progression has a strong genetic basis, and is comparable to traits with well-established genetic bases. We identify a novel association between early TB progression and variants located in a putative enhancer region on chromosome 3q23 (rs73226617, OR = 1.18; P = 3.93 × 10
-8 ). With in silico and in vitro analyses we identify rs73226617 or rs148722713 as the likely functional variant and ATP1B3 as a potential causal target gene with monocyte specific function.- Published
- 2019
- Full Text
- View/download PDF
23. Benchmarker: An Unbiased, Association-Data-Driven Strategy to Evaluate Gene Prioritization Algorithms.
- Author
-
Fine RS, Pers TH, Amariuta T, Raychaudhuri S, and Hirschhorn JN
- Subjects
- Benchmarking, Chromosome Mapping, Humans, Phenotype, Algorithms, Genetic Loci, Genome-Wide Association Study methods, Linkage Disequilibrium, Polymorphism, Single Nucleotide
- Abstract
Genome-wide association studies (GWASs) are valuable for understanding human biology, but associated loci typically contain multiple associated variants and genes. Thus, algorithms that prioritize likely causal genes and variants for a given phenotype can provide biological interpretations of association data. However, a critical, currently missing capability is to objectively compare performance of such algorithms. Typical comparisons rely on "gold standard" genes harboring causal coding variants, but such gold standards may be biased and incomplete. To address this issue, we developed Benchmarker, an unbiased, data-driven benchmarking method that compares performance of similarity-based prioritization strategies to each other (and to random chance) by leave-one-chromosome-out cross-validation with stratified linkage disequilibrium (LD) score regression. We first applied Benchmarker to 20 well-powered GWASs and compared gene prioritization based on strategies employing three different data sources, including annotated gene sets and gene expression; genes prioritized based on gene sets had higher per-SNP heritability than those prioritized based on gene expression. Additionally, in a direct comparison of three methods, DEPICT and MAGMA outperformed NetWAS. We also evaluated combinations of methods; our results indicated that combining data sources and algorithms can help prioritize higher-quality genes for follow-up. Benchmarker provides an unbiased approach to evaluate any similarity-based method that provides genome-wide prioritization of genes, variants, or gene sets and can determine the best such method for any particular GWAS. Our method addresses an important unmet need for rigorous tool assessment and can assist in mapping genetic associations to causal function., (Copyright © 2019 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.)
- Published
- 2019
- Full Text
- View/download PDF
24. IMPACT: Genomic Annotation of Cell-State-Specific Regulatory Elements Inferred from the Epigenome of Bound Transcription Factors.
- Author
-
Amariuta T, Luo Y, Gazal S, Davenport EE, van de Geijn B, Ishigaki K, Westra HJ, Teslovich N, Okada Y, Yamamoto K, Price AL, and Raychaudhuri S
- Subjects
- Arthritis, Rheumatoid genetics, Chromatin genetics, Chromatin metabolism, Computational Biology methods, Histones genetics, Histones metabolism, Humans, Promoter Regions, Genetic, Protein Binding, Transcription Factors genetics, Arthritis, Rheumatoid metabolism, Epigenome, Epigenomics methods, Genome, Human, Molecular Sequence Annotation, Regulatory Sequences, Nucleic Acid, Transcription Factors metabolism
- Abstract
Despite significant progress in annotating the genome with experimental methods, much of the regulatory noncoding genome remains poorly defined. Here we assert that regulatory elements may be characterized by leveraging local epigenomic signatures where specific transcription factors (TFs) are bound. To link these two features, we introduce IMPACT, a genome annotation strategy that identifies regulatory elements defined by cell-state-specific TF binding profiles, learned from 515 chromatin and sequence annotations. We validate IMPACT using multiple compelling applications. First, IMPACT distinguishes between bound and unbound TF motif sites with high accuracy (average AUPRC 0.81, SE 0.07; across 8 tested TFs) and outperforms state-of-the-art TF binding prediction methods, MocapG, MocapS, and Virtual ChIP-seq. Second, in eight tested cell types, RNA polymerase II IMPACT annotations capture more cis-eQTL variation than sequence-based annotations, such as promoters and TSS windows (25% average increase in enrichment). Third, integration with rheumatoid arthritis (RA) summary statistics from European (N = 38,242) and East Asian (N = 22,515) populations revealed that the top 5% of CD4
+ Treg IMPACT regulatory elements capture 85.7% of RA h2, the most comprehensive explanation for RA h2 to date. In comparison, the average RA h2 captured by compared CD4+ T histone marks is 42.3% and by CD4+ T specifically expressed gene sets is 36.4%. Lastly, we find that IMPACT may be used in many different cell types to identify complex trait associated regulatory elements., (Copyright © 2019 The Author(s). Published by Elsevier Inc. All rights reserved.)- Published
- 2019
- Full Text
- View/download PDF
25. Discovering in vivo cytokine-eQTL interactions from a lupus clinical trial.
- Author
-
Davenport EE, Amariuta T, Gutierrez-Arcelus M, Slowikowski K, Westra HJ, Luo Y, Shen C, Rao DA, Zhang Y, Pearson S, von Schack D, Beebe JS, Bing N, John S, Vincent MS, Zhang B, and Raychaudhuri S
- Subjects
- Humans, Cytokines genetics, Gene Expression Regulation, Lupus Erythematosus, Systemic genetics, Quantitative Trait Loci genetics
- Abstract
Background: Cytokines are critical to human disease and are attractive therapeutic targets given their widespread influence on gene regulation and transcription. Defining the downstream regulatory mechanisms influenced by cytokines is central to defining drug and disease mechanisms. One promising strategy is to use interactions between expression quantitative trait loci (eQTLs) and cytokine levels to define target genes and mechanisms., Results: In a clinical trial for anti-IL-6 in patients with systemic lupus erythematosus, we measure interferon (IFN) status, anti-IL-6 drug exposure, and whole blood genome-wide gene expression at three time points. We show that repeat transcriptomic measurements increases the number of cis eQTLs identified compared to using a single time point. We observe a statistically significant enrichment of in vivo eQTL interactions with IFN status and anti-IL-6 drug exposure and find many novel interactions that have not been previously described. Finally, we find transcription factor binding motifs interrupted by eQTL interaction SNPs, which point to key regulatory mediators of these environmental stimuli and therefore potential therapeutic targets for autoimmune diseases. In particular, genes with IFN interactions are enriched for ISRE binding site motifs, while those with anti-IL-6 interactions are enriched for IRF4 motifs., Conclusions: This study highlights the potential to exploit clinical trial data to discover in vivo eQTL interactions with therapeutically relevant environmental variables.
- Published
- 2018
- Full Text
- View/download PDF
26. Systems proteomics of liver mitochondria function.
- Author
-
Williams EG, Wu Y, Jha P, Dubuis S, Blattmann P, Argmann CA, Houten SM, Amariuta T, Wolski W, Zamboni N, Aebersold R, and Auwerx J
- Subjects
- Animals, Diet, Electron Transport Complex IV genetics, Electron Transport Complex IV metabolism, Genetic Variation, Hep G2 Cells, Humans, Metabolic Networks and Pathways genetics, Metabolome, Metabolomics, Mice, Mice, Inbred Strains, Mitochondria, Liver genetics, Molecular Sequence Data, Proteome, Quantitative Trait Loci, Transcriptome, Cholesterol metabolism, Liver metabolism, Mitochondria, Liver metabolism, Proteomics
- Abstract
Recent improvements in quantitative proteomics approaches, including Sequential Window Acquisition of all Theoretical Mass Spectra (SWATH-MS), permit reproducible large-scale protein measurements across diverse cohorts. Together with genomics, transcriptomics, and other technologies, transomic data sets can be generated that permit detailed analyses across broad molecular interaction networks. Here, we examine mitochondrial links to liver metabolism through the genome, transcriptome, proteome, and metabolome of 386 individuals in the BXD mouse reference population. Several links were validated between genetic variants toward transcripts, proteins, metabolites, and phenotypes. Among these, sequence variants in Cox7a2l alter its protein's activity, which in turn leads to downstream differences in mitochondrial supercomplex formation. This data set demonstrates that the proteome can now be quantified comprehensively, serving as a key complement to transcriptomics, genomics, and metabolomics--a combination moving us forward in complex trait analysis., (Copyright © 2016, American Association for the Advancement of Science.)
- Published
- 2016
- Full Text
- View/download PDF
27. Two-dimensional IR spectroscopy of the anti-HIV agent KP1212 reveals protonated and neutral tautomers that influence pH-dependent mutagenicity.
- Author
-
Peng CS, Fedeles BI, Singh V, Li D, Amariuta T, Essigmann JM, and Tokmakoff A
- Subjects
- Base Sequence, DNA chemistry, DNA genetics, Deoxycytidine chemistry, Deoxycytidine toxicity, Hydrogen-Ion Concentration, Models, Molecular, Molecular Sequence Data, Mutagenicity Tests, Mutation genetics, Quantum Theory, Spectroscopy, Fourier Transform Infrared, Stereoisomerism, Temperature, Water chemistry, Anti-HIV Agents chemistry, Anti-HIV Agents toxicity, Deoxycytidine analogs & derivatives, Protons
- Abstract
Antiviral drugs designed to accelerate viral mutation rates can drive a viral population to extinction in a process called lethal mutagenesis. One such molecule is 5,6-dihydro-5-aza-2'-deoxycytidine (KP1212), a selective mutagen that induces A-to-G and G-to-A mutations in the genome of replicating HIV. The mutagenic property of KP1212 was hypothesized to originate from its amino-imino tautomerism, which would explain its ability to base pair with either G or A. To test the multiple tautomer hypothesis, we used 2D IR spectroscopy, which offers subpicosecond time resolution and structural sensitivity to distinguish among rapidly interconverting tautomers. We identified several KP1212 tautomers and found that >60% of neutral KP1212 is present in the enol-imino form. The abundant proportion of this traditionally rare tautomer offers a compelling structure-based mechanism for pairing with adenine. Additionally, the pKa of KP1212 was measured to be 7.0, meaning a substantial population of KP1212 is protonated at physiological pH. Furthermore, the mutagenicity of KP1212 was found to increase dramatically at pH <7, suggesting a significant biological role for the protonated KP1212 molecules. Overall, our data reveal that the bimodal mutagenic properties of KP1212 result from its unique shape shifting ability that utilizes both tautomerization and protonation.
- Published
- 2015
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.