2,312 results on '"complex traits"'
Search Results
2. Distinct explanations underlie gene-environment interactions in the UK Biobank
- Author
-
Durvasula, Arun and Price, Alkes L.
- Published
- 2025
- Full Text
- View/download PDF
3. Immunological roads diverged: mapping tuberculosis outcomes in mice
- Author
-
Meade, Rachel K. and Smith, Clare M.
- Published
- 2025
- Full Text
- View/download PDF
4. Transcriptome signatures of the lipid metabolism in the liver and partial characterisation of the plasma phospholipidome of a long-distance migratory bird, the Northern Wheatear (Oenanthe oenanthe)
- Author
-
Frias-Soler, Roberto Carlos, Wellbrock, Natalie A., Bindila, Laura, Wink, Michael, and Bairlein, Franz
- Published
- 2025
- Full Text
- View/download PDF
5. Ancient Secretory Pathways Contributed to the Evolutionary Origin of an Ecologically Impactful Bioluminescence System.
- Author
-
Mesrop, Lisa, Minsky, Geetanjali, Drummond, Michael, Goodheart, Jessica, Proulx, Stephen, and Oakley, Todd
- Subjects
WGCNA ,bioluminescence ,complex traits ,evo-devo ,evolution ,novelty ,Animals ,Secretory Pathway ,Biological Evolution ,Luminescence ,Crustacea ,Evolution ,Molecular ,Luciferases - Abstract
Evolutionary innovations in chemical secretion-such as the production of secondary metabolites, pheromones, and toxins-profoundly impact ecological interactions across a broad diversity of life. These secretory innovations may involve a legacy-plus-innovation mode of evolution, whereby new biochemical pathways are integrated with conserved secretory processes to create novel products. Among secretory innovations, bioluminescence is important because it evolved convergently many times to influence predator-prey interactions, while often producing courtship signals linked to increased rates of speciation. However, whether or not deeply conserved secretory genes are used in secretory bioluminescence remains unexplored. Here, we show that in the ostracod Vargula tsujii, the evolutionary novel c-luciferase gene is co-expressed with many conserved genes, including those related to toxin production and high-output protein secretion. Our results demonstrate that the legacy-plus-innovation mode of secretory evolution, previously applied to sensory modalities of olfaction, gustation, and nociception, also encompasses light-producing signals generated by bioluminescent secretions. This extension broadens the paradigm of secretory diversification to include not only chemical signals but also bioluminescent light as an important medium of ecological interaction and evolutionary innovation.
- Published
- 2024
6. A scalable and robust variance components method reveals insights into the architecture of gene-environment interactions underlying complex traits.
- Author
-
Pazokitoroudi, Ali, Liu, Zhengtong, Dahl, Andrew, Zaitlen, Noah, Rosset, Saharon, and Sankararaman, Sriram
- Subjects
UK Biobank ,complex traits ,gene-context interaction ,gene-drug interaction ,gene-environment interaction ,genetic architecture of gene-environment interactions ,noise heterogeneity ,patitioning GxE heritability ,scalable variance component analysis ,Humans ,Gene-Environment Interaction ,Polymorphism ,Single Nucleotide ,Genome-Wide Association Study ,Multifactorial Inheritance ,Male ,Female ,Quantitative Trait ,Heritable ,Phenotype ,Models ,Genetic ,Quantitative Trait Loci - Abstract
Understanding the contribution of gene-environment interactions (GxE) to complex trait variation can provide insights into disease mechanisms, explain sources of heritability, and improve genetic risk prediction. While large biobanks with genetic and deep phenotypic data hold promise for obtaining novel insights into GxE, our understanding of GxE architecture in complex traits remains limited. We introduce a method to estimate the proportion of trait variance explained by GxE (GxE heritability) and additive genetic effects (additive heritability) across the genome and within specific genomic annotations. We show that our method is accurate in simulations and computationally efficient for biobank-scale datasets. We applied our method to common array SNPs (MAF ≥1%), fifty quantitative traits, and four environmental variables (smoking, sex, age, and statin usage) in unrelated white British individuals in the UK Biobank. We found 68 trait-E pairs with significant genome-wide GxE heritability (p
- Published
- 2024
7. Similar enzymatic functions in distinct bioluminescence systems: evolutionary recruitment of sulfotransferases in ostracod light organs.
- Author
-
Lau, Emily, Goodheart, Jessica, Anderson, Nolan, Liu, Vannie, Mukherjee, Arnab, and Oakley, Todd
- Subjects
bioluminescence ,complex traits ,convergent evolution ,gene expression ,parallel evolution ,sulfotransferase ,Animals ,Sulfotransferases ,Crustacea ,Phylogeny ,Evolution ,Molecular ,Luminescence - Abstract
Genes from ancient families are sometimes involved in the convergent evolutionary origins of similar traits, even across vast phylogenetic distances. Sulfotransferases are an ancient family of enzymes that transfer sulfate from a donor to a wide variety of substrates, including probable roles in some bioluminescence systems. Here, we demonstrate multiple sulfotransferases, highly expressed in light organs of the bioluminescent ostracod Vargula tsujii, transfer sulfate in vitro to the luciferin substrate, vargulin. We find luciferin sulfotransferases (LSTs) of ostracods are not orthologous to known LSTs of fireflies or sea pansies; animals with distinct and convergently evolved bioluminescence systems compared to ostracods. Therefore, distantly related sulfotransferases were independently recruited at least three times, leading to parallel evolution of luciferin metabolism in three highly diverged organisms. Reuse of homologous genes is surprising in these bioluminescence systems because the other components, including luciferins and luciferases, are completely distinct. Whether convergently evolved traits incorporate ancient genes with similar functions or instead use distinct, often newer, genes may be constrained by how many genetic solutions exist for a particular function. When fewer solutions exist, as in genetic sulfation of small molecules, evolution may be more constrained to use the same genes time and again.
- Published
- 2024
8. mmGEBLUP: an advanced genomic prediction scheme for genetic improvement of complex traits in crops through integrative analysis of major genes, polygenes, and genotype–environment interactions.
- Author
-
Zhang, Qi-Xin, Zhu, Tianneng, Lin, Feng, Fang, Dunhuang, Chen, Xuejun, Lou, Xiangyang, Tong, Zhijun, Xiao, Bingguang, and Xu, Hai-Ming
- Abstract
Current genomic prediction (GP) models often fall short of fully capturing the genetic architecture of complex traits and providing practical breeding guidance, particularly under varying environments. Here, we propose the mmGEBLUP, an advanced GP scheme designed to tackle the current limitations in fully exploiting the genetic architecture of complex traits and to predict individual breeding value (BV) with multi-environment trial data. Our approach considers four genetic structural indicators to capture the genetic architectures stepwise across four models: the Genomic Best Linear Unbiased Prediction (GBLUP) model considers only main polygenic effects; the GEBLUP model includes both main and genotype-by-environment (GE) interaction polygenic effects; and the mmGBLUP and mmGEBLUP models further incorporate main and GE interaction effects of major genes. Through systematic simulations and applications to nine traits, three in rice and six in tobacco, we show stepwise increases in prediction accuracy from GBLUP to mmGEBLUP, providing evidence on the scale of heritability and polygenicity of traits. In practical terms, we predict four components of BV: major additive, minor additive, major interaction, and minor interaction. Interestingly, we discover that for traits like natural leaf number in tobacco, the major additive BVs for the top 20 individuals are substantially equal; it is the minor additive BV that causes the difference in the total BV. The relative size of major/minor additive BVs suggests performing either marker-assisted selection or genomic selection or both. Overall, mmGEBLUP is an advanced prediction scheme that enhances the understanding of genetic architectures and facilitate the genetic improvement of complex traits in crops under diverse environments. [ABSTRACT FROM AUTHOR]
- Published
- 2025
- Full Text
- View/download PDF
9. Investigative power of genomic informational field theory relative to genome-wide association studies for genotype-phenotype mapping.
- Author
-
Kyratzi, Panagiota, Matika, Oswald, Brassington, Amey H., Clare, Connie E., Xu, Juan, Barrett, David A., Emes, Richard D., Archibald, Alan L., Paldi, Andras, Sinclair, Kevin D., Wattis, Jonathan, and Rauch, Cyril
- Subjects
- *
QUANTITATIVE genetics , *GENOME-wide association studies , *SHEEP , *BASE pairs , *BONE growth - Abstract
Identifying associations between phenotype and genotype is the fundamental basis of genetic analyses. Inspired by frequentist probability and the work of R. A. Fisher, genome-wide association studies (GWAS) extract information using averages and variances from genotype-phenotype datasets. Averages and variances are legitimated upon creating distribution density functions obtained through the grouping of data into categories. However, as data from within a given category cannot be differentiated, the investigative power of such methodologies is limited. Genomic informational field theory (GIFT) is a method specifically designed to circumvent this issue. The way GIFT proceeds is opposite to that of GWAS. Although GWAS determines the extent to which genes are involved in phenotype formation (bottom-up approach), GIFT determines the degree to which the phenotype can select microstates (genes) for its subsistence (top-down approach). Doing so requires dealing with new genetic concepts, a.k.a. genetic paths, upon which significance levels for genotype-phenotype associations can be determined. By using different datasets obtained in Ovis aries related to bone growth (dataset 1) and to a series of linked metabolic and epigenetic pathways (dataset 2), we demonstrate that removing the informational barrier linked to categories enhances the investigative and discriminative powers of GIFT, namely that GIFT extracts more information than GWAS. We conclude by suggesting that GIFT is an adequate tool to study how phenotypic plasticity and genetic assimilation are linked. NEW & NOTEWORTHY: The genetic basis of complex traits remains challenging to investigate using classic genome-wide association studies (GWASs). Given the success of gene editing technologies, this point needs to be addressed urgently since there can only be useful editing technologies whether precise genotype-phenotype mapping information is available initially. Genomic informational field theory (GIFT) is a new mapping method designed to increase the investigative power of biological/medical datasets suggesting, in turn, the need to rethink the conceptual bases of quantitative genetics. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
10. A GWAS for grip strength in cohorts of children—Advantages of analysing young participants for this trait.
- Author
-
Abbondanza, Filippo, Wang, Carol A., Schmitz, Judith, Marianski, Krzysztof, Pennell, Craig E., Whitehouse, Andrew J. O., and Paracchini, Silvia
- Subjects
- *
GENETIC risk score , *LOCUS (Genetics) , *MUSCLE strength , *GENE expression , *CORONARY artery disease , *GRIP strength - Abstract
Grip strength (GS) is a proxy measure for muscular strength and a predictor for bone fracture risk among other diseases. Previous genome‐wide association studies (GWASs) have been conducted in large cohorts of adults focusing on scores collected for the dominant hand, therefore increasing the likelihood of confounding effects by environmental factors. Here, we perform the first GWAS meta‐analyses on maximal GS with the dominant (GSD) and non‐dominant (GSND) hand in two cohorts of children (ALSPAC, N = 5450; age range = 10.65–13.61; Raine Study, N = 1162, age range: 9.42–12.38 years). We identified a novel significant association for GSND (rs9546244, LINC02465, p = 3.43e−08) and replicated associations previously reported in adults including with a HOXB3 gene marker that shows an expression quantitative trait locus (eQTL) effect. Despite a much smaller sample (~3%) compared with the UK Biobank we replicated correlation analyses previously reported in this much larger adult cohort, such as a negative correlation with coronary artery disease. Although the results from the polygenic risk score (PRS) analyses did not survive multiple testing correction, we observed nominally significant associations between GS and risk of overall fracture, as previously reported, as well ADHD which will require further investigations. Finally, we observed a higher SNP‐heritability (24%–41%) compared with previous studies (4%–24%) in adults. Overall, our results suggest that cohorts of children might be better suited for genetic studies of grip strength, possibly due to the shorter exposure to confounding environmental factors compared with adults. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
11. Polymorphic short tandem repeats make widespread contributions to blood and serum traits
- Author
-
Margoliash, Jonathan, Fuchs, Shai, Li, Yang, Zhang, Xuan, Massarat, Arya, Goren, Alon, and Gymrek, Melissa
- Subjects
Epidemiology ,Biological Sciences ,Health Sciences ,Genetics ,Human Genome ,2.1 Biological and endogenous factors ,Generic health relevance ,GWAS ,blood traits ,complex traits ,complex variants ,fine-mapping ,microsatellites ,platelet traits ,short tandem repeats - Abstract
Short tandem repeats (STRs) are genomic regions consisting of repeated sequences of 1-6 bp in succession. Single-nucleotide polymorphism (SNP)-based genome-wide association studies (GWASs) do not fully capture STR effects. To study these effects, we imputed 445,720 STRs into genotype arrays from 408,153 White British UK Biobank participants and tested for association with 44 blood phenotypes. Using two fine-mapping methods, we identify 119 candidate causal STR-trait associations and estimate that STRs account for 5.2%-7.6% of causal variants identifiable from GWASs for these traits. These are among the strongest associations for multiple phenotypes, including a coding CTG repeat associated with apolipoprotein B levels, a promoter CGG repeat with platelet traits, and an intronic poly(A) repeat with mean platelet volume. Our study suggests that STRs make widespread contributions to complex traits, provides stringently selected candidate causal STRs, and demonstrates the need to consider a more complete view of genetic variation in GWASs.
- Published
- 2023
12. Evaluation of heritability partitioning approaches in livestock populations
- Author
-
Can Yuan, José Luis Gualdrón Duarte, Haruko Takeda, Michel Georges, and Tom Druet
- Subjects
Heritability partitioning ,Variance components ,Genetic architecture ,Functional annotation ,Complex traits ,Biotechnology ,TP248.13-248.65 ,Genetics ,QH426-470 - Abstract
Abstract Background Heritability partitioning approaches estimate the contribution of different functional classes, such as coding or regulatory variants, to the genetic variance. This information allows a better understanding of the genetic architecture of complex traits, including complex diseases, but can also help improve the accuracy of genomic selection in livestock species. However, methods have mainly been tested on human genomic data, whereas livestock populations have specific characteristics, such as high levels of relatedness, small effective population size or long-range levels of linkage disequilibrium. Results Here, we used data from 14,762 cows, imputed at the whole-genome sequence level for 11,537,240 variants, to simulate traits in a typical livestock population and evaluate the accuracy of two state-of-the-art heritability partitioning methods, GREML and a Bayesian mixture model. In simulations where a single functional class had increased contribution to heritability, we observed that the estimators were unbiased but had low precision. When causal variants were enriched in variants with low ( 0.20) minor allele frequency or low (below 1st quartile) or high (above 3rd quartile) linkage disequilibrium scores, it was necessary to partition the genetic variance into multiple classes defined on the basis of allele frequencies or LD scores to obtain unbiased results. When multiple functional classes had variable contributions to heritability, estimators showed higher levels of variation and confounding between certain categories was observed. In addition, estimators from small categories were particularly imprecise. However, the estimates and their ranking were still informative about the contribution of the classes. We also demonstrated that using methods that estimate the contribution of a single category at a time, a commonly used approach, results in an overestimation. Finally, we applied the methods to phenotypes for muscular development and height and estimated that, on average, variants in open chromatin regions had a higher contribution to the genetic variance (> 45%), while variants in coding regions had the strongest individual effects (> 25-fold enrichment on average). Conversely, variants in intergenic or intronic regions showed lower levels of enrichment (0.2 and 0.6-fold on average, respectively). Conclusions Heritability partitioning approaches should be used cautiously in livestock populations, in particular for small categories. Two-component approaches that fit only one functional category at a time lead to biased estimators and should not be used.
- Published
- 2024
- Full Text
- View/download PDF
13. The omics basis of human health : investigating plasma proteins and their genetic effects on complex traits
- Author
-
Repetto, Linda, Wilson, Jim, Shen, Xia, Navarro, Pau, and Haley, Chris
- Subjects
Omics ,plasma proteins ,complex traits ,pathophysiology ,complex phenotypes ,genome-wide association (GWA) ,proteomics - Abstract
Over the past decade, the advancements in technology and the growing amount of identified genetic variants have led to a high number of important discoveries in the field of precision medicine concerning human biology and pathophysiology. However, it became evident that genomics alone could not properly explain the onset and regulation of the specific molecular mechanisms of certain phenotypes. Studying omics helped complement this gap in genetic research, providing detailed information on the quantification of molecules that are involved in structural and functional processes in the organism. Specifically, protein production, levels, and regulation are dynamic and change during the course of one's lifetime. This information has proven fundamental to understanding how certain proteins affect complex phenotypes such as neurological and psychiatric disorders. In this thesis, I describe the three groups of analyses I conducted over the course of my doctoral programme on different sets of blood plasma proteins and over a broad range of neurological, psychiatric, cardiovascular, and electrophysiology phenotypes. The underlying mechanisms that trigger the onset of psychiatric and neurological conditions are often not limited to the nervous system, but rather stem from multi-system molecular triggers. The first part of the work I carried out aims at investigating the frequent co-occurrence and comorbidity of neurological and cardiovascular phenotypes by conducting a genome-wide association (GWA) meta-analysis of 183 neurology-related blood proteins on data from over 12000 individuals. The second part concerns the bivariate and multivariate analyses conducted on 276 cardiology and inflammatory proteins, while the third illustrates the contribution to consortia focussed on heart rate and electrophysiology. Results from the second and third parts of the work provided information that played an important role in understanding a part of the genetic mechanisms of the complex traits of interest. Overall, the results presented in this thesis strongly support the notion that proteomics is an important tool to be used to study complex traits and drug discovery and development should focus on targeting protein synthesis and regulation. Furthermore, the results also support the notion that complex diseases involve more than one biological system, and in order to gain a better understanding of human pathology, it is fundamental to study the causes and effects across the entire organism.
- Published
- 2023
- Full Text
- View/download PDF
14. Liver eQTL meta-analysis illuminates potential molecular mechanisms of cardiometabolic traits.
- Author
-
Broadaway, K. Alaine, Brotman, Sarah M., Rosen, Jonathan D., Currin, Kevin W., Alkhawaja, Abdalla A., Etheridge, Amy S., Wright, Fred, Gallins, Paul, Jima, Dereje, Zhou, Yi-hui, Love, Michael I., Innocenti, Federico, and Mohlke, Karen L.
- Subjects
- *
GENOME-wide association studies , *GENE expression , *CHROMATIN , *SIGNALS & signaling , *LIVER - Abstract
Understanding the molecular mechanisms of complex traits is essential for developing targeted interventions. We analyzed liver expression quantitative-trait locus (eQTL) meta-analysis data on 1,183 participants to identify conditionally distinct signals. We found 9,013 eQTL signals for 6,564 genes; 23% of eGenes had two signals, and 6% had three or more signals. We then integrated the eQTL results with data from 29 cardiometabolic genome-wide association study (GWAS) traits and identified 1,582 GWAS-eQTL colocalizations for 747 eGenes. Non-primary eQTL signals accounted for 17% of all colocalizations. Isolating signals by conditional analysis prior to coloc resulted in 37% more colocalizations than using marginal eQTL and GWAS data, highlighting the importance of signal isolation. Isolating signals also led to stronger evidence of colocalization: among 343 eQTL-GWAS signal pairs in multi-signal regions, analyses that isolated the signals of interest resulted in higher posterior probability of colocalization for 41% of tests. Leveraging allelic heterogeneity, we predicted causal effects of gene expression on liver traits for four genes. To predict functional variants and regulatory elements, we colocalized eQTL with liver chromatin accessibility QTL (caQTL) and found 391 colocalizations, including 73 with non-primary eQTL signals and 60 eQTL signals that colocalized with both a caQTL and a GWAS signal. Finally, we used publicly available massively parallel reporter assays in HepG2 to highlight 14 eQTL signals that include at least one expression-modulating variant. This multi-faceted approach to unraveling the genetic underpinnings of liver-related traits could lead to therapeutic development. Understanding molecular mechanisms of complex traits is essential for developing targeted interventions. We performed a liver eQTL meta-analysis, identifying 9,013 signals for 6,564 genes. We highlighted the value of analyzing distinct signals. Integrating eQTL with GWAS, chromatin QTL, and transcriptional reporter assays provided insights into genetic underpinnings of liver-related traits. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
15. Application of Pan-Omics Technologies in Research on Important Economic Traits for Ruminants.
- Author
-
Gao, Zhendong, Lu, Ying, Li, Mengfei, Chong, Yuqing, Hong, Jieyun, Wu, Jiao, Wu, Dongwang, Xi, Dongmei, and Deng, Weidong
- Subjects
- *
TRANSCRIPTOMES , *EPIGENOMICS , *RUMINANTS , *GENOMICS , *GENETICS - Abstract
The economic significance of ruminants in agriculture underscores the need for advanced research methodologies to enhance their traits. This review aims to elucidate the transformative role of pan-omics technologies in ruminant research, focusing on their application in uncovering the genetic mechanisms underlying complex traits such as growth, reproduction, production performance, and rumen function. Pan-omics analysis not only helps in identifying key genes and their regulatory networks associated with important economic traits but also reveals the impact of environmental factors on trait expression. By integrating genomics, epigenomics, transcriptomics, metabolomics, and microbiomics, pan-omics enables a comprehensive analysis of the interplay between genetics and environmental factors, offering a holistic understanding of trait expression. We explore specific examples of economic traits where these technologies have been pivotal, highlighting key genes and regulatory networks identified through pan-omics approaches. Additionally, we trace the historical evolution of each omics field, detailing their progression from foundational discoveries to high-throughput platforms. This review provides a critical synthesis of recent advancements, offering new insights and practical recommendations for the application of pan-omics in the ruminant industry. The broader implications for modern animal husbandry are discussed, emphasizing the potential for these technologies to drive sustainable improvements in ruminant production systems. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
16. Parallel Evolution at the Regulatory Base-Pair Level Contributes to Mammalian Interspecific Differences in Polygenic Traits.
- Author
-
Okamoto, Alexander S and Capellini, Terence D
- Subjects
BIOLOGICAL evolution ,ERYTHROCYTES ,CONSERVED sequences (Genetics) ,BLOOD cell count ,BINDING sites - Abstract
Parallel evolution occurs when distinct lineages with similar ancestral states converge on a new phenotype. Parallel evolution has been well documented at the organ, gene pathway, and amino acid sequence level but in theory, it can also occur at individual nucleotides within noncoding regions. To examine the role of parallel evolution in shaping the biology of mammalian complex traits, we used data on single-nucleotide polymorphisms (SNPs) influencing human intraspecific variation to predict trait values in other species for 11 complex traits. We found that the alleles at SNP positions associated with human intraspecific height and red blood cell (RBC) count variation are associated with interspecific variation in the corresponding traits across mammals. These associations hold for deeper branches of mammalian evolution as well as between strains of collaborative cross mice. While variation in RBC count between primates uses both ancient and more recently evolved genomic regions, we found that only primate-specific elements were correlated with primate body size. We show that the SNP positions driving these signals are flanked by conserved sequences, maintain synteny with target genes, and overlap transcription factor binding sites. This work highlights the potential of conserved but tunable regulatory elements to be reused in parallel to facilitate evolutionary adaptation in mammals. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
17. Evaluation of heritability partitioning approaches in livestock populations.
- Author
-
Yuan, Can, Gualdrón Duarte, José Luis, Takeda, Haruko, Georges, Michel, and Druet, Tom
- Subjects
HERITABILITY ,WHOLE genome sequencing ,LINKAGE disequilibrium ,LIVESTOCK ,GENE frequency ,PHENOTYPES - Abstract
Background: Heritability partitioning approaches estimate the contribution of different functional classes, such as coding or regulatory variants, to the genetic variance. This information allows a better understanding of the genetic architecture of complex traits, including complex diseases, but can also help improve the accuracy of genomic selection in livestock species. However, methods have mainly been tested on human genomic data, whereas livestock populations have specific characteristics, such as high levels of relatedness, small effective population size or long-range levels of linkage disequilibrium. Results: Here, we used data from 14,762 cows, imputed at the whole-genome sequence level for 11,537,240 variants, to simulate traits in a typical livestock population and evaluate the accuracy of two state-of-the-art heritability partitioning methods, GREML and a Bayesian mixture model. In simulations where a single functional class had increased contribution to heritability, we observed that the estimators were unbiased but had low precision. When causal variants were enriched in variants with low (< 0.05) or high (> 0.20) minor allele frequency or low (below 1st quartile) or high (above 3rd quartile) linkage disequilibrium scores, it was necessary to partition the genetic variance into multiple classes defined on the basis of allele frequencies or LD scores to obtain unbiased results. When multiple functional classes had variable contributions to heritability, estimators showed higher levels of variation and confounding between certain categories was observed. In addition, estimators from small categories were particularly imprecise. However, the estimates and their ranking were still informative about the contribution of the classes. We also demonstrated that using methods that estimate the contribution of a single category at a time, a commonly used approach, results in an overestimation. Finally, we applied the methods to phenotypes for muscular development and height and estimated that, on average, variants in open chromatin regions had a higher contribution to the genetic variance (> 45%), while variants in coding regions had the strongest individual effects (> 25-fold enrichment on average). Conversely, variants in intergenic or intronic regions showed lower levels of enrichment (0.2 and 0.6-fold on average, respectively). Conclusions: Heritability partitioning approaches should be used cautiously in livestock populations, in particular for small categories. Two-component approaches that fit only one functional category at a time lead to biased estimators and should not be used. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
18. 棉籽油性状全基因组关联分析研究进展.
- Author
-
刘文豪, 余渝, 吴珂, 刘丽, 李航, and 孔宪辉
- Abstract
Cottonseed oil, soft in taste and golden in color, plays a crucial role in various foods. In addition, cottonseed oil has attracted much attention due to its unique fatty acid properties, anti-inflammatory and heart protective properties. Therefore, it is of great significance to understand the genetic mechanism that regulates the biosynthesis of cottonseed storage oil, carry out key gene mining, and select high-quality cotton varieties with high oil content. With the rapid development of high-throughput sequencing technology and bioinformatics analysis tools, whole genome sequencing has become simpler and more efficient, genome-wide association analysis (GWAS) has become a key tool for studying complex traits. This paper introduces the principle and research advantages of GWAS, discusses the progress of its application in the research of different oil crops and cotton characters. Some suggestions for future genetic studies of cottonseed oil related traits were put forward. The existing problems were discussed and prospected in order to provide a theoretical basis for future studies on the genetic basis of cottonseed oil traits and marker-assisted breeding by GWAS. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
19. Analysis of Evolutionary Conservation, Expression Level, and Genetic Association at a Genome-wide Scale Reveals Heterogeneity Across Polygenic Phenotypes.
- Author
-
Giel, Ann-Sophie, Bigge, Jessica, Schumacher, Johannes, Maj, Carlo, and Dasmeh, Pouria
- Subjects
GENE expression ,NATURAL selection ,GENETIC correlations ,CORONARY artery disease ,GENOME-wide association studies - Abstract
Understanding the expression level and evolutionary rate of associated genes with human polygenic diseases provides crucial insights into their disease-contributing roles. In this work, we leveraged genome-wide association studies (GWASs) to investigate the relationship between the genetic association and both the evolutionary rate (d N /d S) and expression level of human genes associated with the two polygenic diseases of schizophrenia and coronary artery disease. Our findings highlight a distinct variation in these relationships between the two diseases. Genes associated with both diseases exhibit a significantly greater variance in evolutionary rate compared to those implicated in monogenic diseases. Expanding our analyses to 4,756 complex traits in the GWAS atlas database, we unraveled distinct trait categories with a unique interplay among the evolutionary rate, expression level, and genetic association of human genes. In most polygenic traits, highly expressed genes were more associated with the polygenic phenotypes compared to lowly expressed genes. About 69% of polygenic traits displayed a negative correlation between genetic association and evolutionary rate, while approximately 30% of these traits showed a positive correlation between genetic association and evolutionary rate. Our results demonstrate the presence of a spectrum among complex traits, shaped by natural selection. Notably, at opposite ends of this spectrum, we find metabolic traits being more likely influenced by purifying selection, and immunological traits that are more likely shaped by positive selection. We further established the polygenic evolution portal (evopolygen.de) as a resource for investigating relationships and generating hypotheses in the field of human polygenic trait evolution. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
20. Mutation and selection processes regulating short tandem repeats give rise to genetic and phenotypic diversity across species
- Author
-
Verbiest, Max, Maksimov, Mikhail, Jin, Ye, Anisimova, Maria, Gymrek, Melissa, and Sonay, Tugce Bilgin
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Evolutionary Biology ,Genetics ,Human Genome ,Generic health relevance ,Mutation ,Genome ,Genotype ,Phenotype ,Microsatellite Repeats ,short tandem repeats ,microsatellites ,DNA repair ,selection ,complex traits ,evolution ,Ecology ,Zoology ,Evolutionary biology - Abstract
Short tandem repeats (STRs) are units of 1-6 bp that repeat in a tandem fashion in DNA. Along with single nucleotide polymorphisms and large structural variations, they are among the major genomic variants underlying genetic, and likely phenotypic, divergence. STRs experience mutation rates that are orders of magnitude higher than other well-studied genotypic variants. Frequent copy number changes result in a wide range of alleles, and provide unique opportunities for modulating complex phenotypes through variation in repeat length. While classical studies have identified key roles of individual STR loci, the advent of improved sequencing technology, high-quality genome assemblies for diverse species, and bioinformatics methods for genome-wide STR analysis now enable more systematic study of STR variation across wide evolutionary ranges. In this review, we explore mutation and selection processes that affect STR copy number evolution, and how these processes give rise to varying STR patterns both within and across species. Finally, we review recent examples of functional and adaptive changes linked to STRs.
- Published
- 2023
21. Multiple solutions at the genomic level in response to selective breeding for high locomotor activity.
- Author
-
Hillis, David and Garland, Theodore
- Subjects
complex traits ,experimental evolution ,gene of major effect ,mammalian genetics ,multiple solutions ,polygenic adaptation ,Mice ,Animals ,Selective Breeding ,Body Size ,Genomics ,Locomotion - Abstract
Replicate lines under uniform selection often evolve in different ways. Previously, analyses using whole-genome sequence data for individual mice (Mus musculus) from 4 replicate High Runner lines and 4 nonselected control lines demonstrated genomic regions that have responded consistently to selection for voluntary wheel-running behavior. Here, we ask whether the High Runner lines have evolved differently from each other, even though they reached selection limits at similar levels. We focus on 1 High Runner line (HR3) that became fixed for a mutation at a gene of major effect (Myh4Minimsc) that, in the homozygous condition, causes a 50% reduction in hindlimb muscle mass and many pleiotropic effects. We excluded HR3 from SNP analyses and identified 19 regions not consistently identified in analyses with all 4 lines. Repeating analyses while dropping each of the other High Runner lines identified 12, 8, and 6 such regions. (Of these 45 regions, 37 were unique.) These results suggest that each High Runner line indeed responded to selection somewhat uniquely, but also that HR3 is the most distinct. We then applied 2 additional analytical approaches when dropping HR3 only (based on haplotypes and nonstatistical tests involving fixation patterns). All 3 approaches identified 7 new regions (as compared with analyses using all 4 High Runner lines) that include genes associated with activity levels, dopamine signaling, hippocampus morphology, heart size, and body size, all of which differ between High Runner and control lines. Our results illustrate how multiple solutions and private alleles can obscure general signatures of selection involving public alleles.
- Published
- 2023
22. Systems genetics approaches for understanding complex traits with relevance for human disease
- Author
-
Allayee, Hooman, Farber, Charles R, Seldin, Marcus M, Williams, Evan Graehl, James, David E, and Lusis, Aldons J
- Subjects
Biological Sciences ,Genetics ,2.1 Biological and endogenous factors ,Generic health relevance ,Humans ,Multifactorial Inheritance ,Systems Biology ,Phenotype ,systems genetics ,complex traits ,omics ,mouse models ,human populations ,computational biology ,genetics ,genomics ,systems biology ,Biochemistry and Cell Biology ,Biological sciences ,Biomedical and clinical sciences ,Health sciences - Abstract
Quantitative traits are often complex because of the contribution of many loci, with further complexity added by environmental factors. In medical research, systems genetics is a powerful approach for the study of complex traits, as it integrates intermediate phenotypes, such as RNA, protein, and metabolite levels, to understand molecular and physiological phenotypes linking discrete DNA sequence variation to complex clinical and physiological traits. The primary purpose of this review is to describe some of the resources and tools of systems genetics in humans and rodent models, so that researchers in many areas of biology and medicine can make use of the data.
- Published
- 2023
23. Breaking down causes, consequences, and mediating effects of telomere length variation on human health
- Author
-
Samuel Moix, Marie C Sadler, Zoltán Kutalik, and Chiara Auwerx
- Subjects
Telomeres ,Mendelian randomization ,UK Biobank ,Complex traits ,Lifespan ,Aging ,Biology (General) ,QH301-705.5 ,Genetics ,QH426-470 - Abstract
Abstract Background Telomeres form repeated DNA sequences at the ends of chromosomes, which shorten with each cell division. Yet, factors modulating telomere attrition and the health consequences thereof are not fully understood. To address this, we leveraged data from 326,363 unrelated UK Biobank participants of European ancestry. Results Using linear regression and bidirectional univariable and multivariable Mendelian randomization (MR), we elucidate the relationships between leukocyte telomere length (LTL) and 142 complex traits, including diseases, biomarkers, and lifestyle factors. We confirm that telomeres shorten with age and show a stronger decline in males than in females, with these factors contributing to the majority of the 5.4% of LTL variance explained by the phenome. MR reveals 23 traits modulating LTL. Smoking cessation and high educational attainment associate with longer LTL, while weekly alcohol intake, body mass index, urate levels, and female reproductive events, such as childbirth, associate with shorter LTL. We also identify 24 traits affected by LTL, with risk for cardiovascular, pulmonary, and some autoimmune diseases being increased by short LTL, while longer LTL increased risk for other autoimmune conditions and cancers. Through multivariable MR, we show that LTL may partially mediate the impact of educational attainment, body mass index, and female age at childbirth on proxied lifespan. Conclusions Our study sheds light on the modulators, consequences, and the mediatory role of telomeres, portraying an intricate relationship between LTL, diseases, lifestyle, and socio-economic factors.
- Published
- 2024
- Full Text
- View/download PDF
24. Exploring the Interplay between the Hologenome and Complex Traits in Bovine and Porcine Animals Using Genome-Wide Association Analysis.
- Author
-
Qadri, Qamar Raza, Lai, Xueshuang, Zhao, Wei, Zhang, Zhenyang, Zhao, Qingbo, Ma, Peipei, Pan, Yuchun, and Wang, Qishan
- Subjects
- *
GENOME-wide association studies , *GUT microbiome , *LOCUS (Genetics) , *CATTLE physiology , *BONFERRONI correction , *MILK yield , *BOS - Abstract
Genome-wide association studies (GWAS) significantly enhance our ability to identify trait-associated genomic variants by considering the host genome. Moreover, the hologenome refers to the host organism's collective genetic material and its associated microbiome. In this study, we utilized the hologenome framework, called Hologenome-wide association studies (HWAS), to dissect the architecture of complex traits, including milk yield, methane emissions, rumen physiology in cattle, and gut microbial composition in pigs. We employed four statistical models: (1) GWAS, (2) Microbial GWAS (M-GWAS), (3) HWAS-CG (hologenome interaction estimated using COvariance between Random Effects Genome-based restricted maximum likelihood (CORE-GREML)), and (4) HWAS-H (hologenome interaction estimated using the Hadamard product method). We applied Bonferroni correction to interpret the significant associations in the complex traits. The GWAS and M-GWAS detected one and sixteen significant SNPs for milk yield traits, respectively, whereas the HWAS-CG and HWAS-H each identified eight SNPs. Moreover, HWAS-CG revealed four, and the remaining models identified three SNPs each for methane emissions traits. The GWAS and HWAS-CG detected one and three SNPs for rumen physiology traits, respectively. For the pigs' gut microbial composition traits, the GWAS, M-GWAS, HWAS-CG, and HWAS-H identified 14, 16, 13, and 12 SNPs, respectively. We further explored these associations through SNP annotation and by analyzing biological processes and functional pathways. Additionally, we integrated our GWA results with expression quantitative trait locus (eQTL) data using transcriptome-wide association studies (TWAS) and summary-based Mendelian randomization (SMR) methods for a more comprehensive understanding of SNP-trait associations. Our study revealed hologenomic variability in agriculturally important traits, enhancing our understanding of host-microbiome interactions. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
25. Naturally segregating genetic variants contribute to thermal tolerance in a Drosophila melanogaster model system.
- Author
-
Williams-Simon, Patricka A, Oster, Camille, Moaton, Jordyn A, Ghidey, Ronel, Ng'oma, Enoch, Middleton, Kevin M, and King, Elizabeth G
- Subjects
- *
BIOLOGICAL models , *RESEARCH funding , *DESCRIPTIVE statistics , *QUANTITATIVE research , *GENETIC variation , *HEAT , *RNA , *ANIMAL experimentation , *GENE expression profiling , *INSECTS , *GENOMES , *SEQUENCE analysis - Abstract
Thermal tolerance is a fundamental physiological complex trait for survival in many species. For example, everyday tasks such as foraging, finding a mate, and avoiding predation are highly dependent on how well an organism can tolerate extreme temperatures. Understanding the general architecture of the natural variants within the genes that control this trait is of high importance if we want to better comprehend thermal physiology. Here, we take a multipronged approach to further dissect the genetic architecture that controls thermal tolerance in natural populations using the Drosophila Synthetic Population Resource as a model system. First, we used quantitative genetics and Quantitative Trait Loci mapping to identify major effect regions within the genome that influences thermal tolerance, then integrated RNA-sequencing to identify differences in gene expression, and lastly, we used the RNAi system to (1) alter tissue-specific gene expression and (2) functionally validate our findings. This powerful integration of approaches not only allows for the identification of the genetic basis of thermal tolerance but also the physiology of thermal tolerance in a natural population, which ultimately elucidates thermal tolerance through a fitness-associated lens. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
26. Estimating disease heritability from complex pedigrees allowing for ascertainment and covariates.
- Author
-
Speed, Doug and Evans, David M.
- Subjects
- *
HERITABILITY , *NOSOLOGY , *GENEALOGY , *PHENOTYPES ,INTERNATIONAL Statistical Classification of Diseases & Related Health Problems - Abstract
We propose TetraHer, a method for estimating the liability heritability of binary phenotypes. TetraHer has five key features. First, it can be applied to data from complex pedigrees that contain multiple types of relationships. Second, it can correct for ascertainment of cases or controls. Third, it can accommodate covariates. Fourth, it can model the contribution of common environment. Fifth, it produces a likelihood that can be used for significance testing. We first demonstrate the validity of TetraHer on simulated data. We then use TetraHer to estimate liability heritability for 229 codes from the tenth International Classification of Diseases (ICD-10). We identify 107 codes with significant heritability (p < 0.05/229), which can be used in future analyses for investigating the genetic architecture of human diseases. TetraHer is a new method for estimating the heritability of diseases that allows for ascertainment of cases and for contributions of covariates. Applied to UK Biobank data, TetraHer finds 107 ICD-10 codes with significant heritability. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
27. Trait selection strategy in multi-trait GWAS: Boosting SNP discoverability
- Author
-
Yuka Suzuki, Hervé Ménager, Bryan Brancotte, Raphaël Vernet, Cyril Nerin, Christophe Boetto, Antoine Auvergne, Christophe Linhard, Rachel Torchet, Pierre Lechat, Lucie Troubat, Michael H. Cho, Emmanuelle Bouzigon, Hugues Aschard, and Hanna Julienne
- Subjects
Complex traits ,Variant mapping ,Multi-trait GWAS ,Statistical power ,Genetic Architecture ,Genetics ,QH426-470 - Abstract
Summary: Since the first genome-wide association studies (GWASs), thousands of variant-trait associations have been discovered. However, comprehensively mapping the genetic determinant of complex traits through univariate testing can require prohibitive sample sizes. Multi-trait GWAS can circumvent this issue and improve statistical power by leveraging the joint genetic architecture of human phenotypes. Although many methodological hurdles of multi-trait testing have been solved, the strategy to select traits has been overlooked. In this study, we conducted multi-trait GWAS on approximately 20,000 combinations of 72 traits using an omnibus test as implemented in the Joint Analysis of Summary Statistics. We assessed which genetic features of the sets of traits analyzed were associated with an increased detection of variants compared with univariate screening. Several features of the set of traits, including the heritability, the number of traits, and the genetic correlation, drive the multi-trait test gain. Using these features jointly in predictive models captures a large fraction of the power gain of the multi-trait test (Pearson’s r between the observed and predicted gain equals 0.43, p
- Published
- 2024
- Full Text
- View/download PDF
28. A multi-layer functional genomic analysis to understand noncoding genetic variation in lipids
- Author
-
Ramdas, Shweta, Judd, Jonathan, Graham, Sarah E, Kanoni, Stavroula, Wang, Yuxuan, Surakka, Ida, Wenz, Brandon, Clarke, Shoa L, Chesi, Alessandra, Wells, Andrew, Bhatti, Konain Fatima, Vedantam, Sailaja, Winkler, Thomas W, Locke, Adam E, Marouli, Eirini, Zajac, Greg JM, Wu, Kuan-Han H, Ntalla, Ioanna, Hui, Qin, Klarin, Derek, Hilliard, Austin T, Wang, Zeyuan, Xue, Chao, Thorleifsson, Gudmar, Helgadottir, Anna, Gudbjartsson, Daniel F, Holm, Hilma, Olafsson, Isleifur, Hwang, Mi Yeong, Han, Sohee, Akiyama, Masato, Sakaue, Saori, Terao, Chikashi, Kanai, Masahiro, Zhou, Wei, Brumpton, Ben M, Rasheed, Humaira, Havulinna, Aki S, Veturi, Yogasudha, Pacheco, Jennifer Allen, Rosenthal, Elisabeth A, Lingren, Todd, Feng, QiPing, Kullo, Iftikhar J, Narita, Akira, Takayama, Jun, Martin, Hilary C, Hunt, Karen A, Trivedi, Bhavi, Haessler, Jeffrey, Giulianini, Franco, Bradford, Yuki, Miller, Jason E, Campbell, Archie, Lin, Kuang, Millwood, Iona Y, Rasheed, Asif, Hindy, George, Faul, Jessica D, Zhao, Wei, Weir, David R, Turman, Constance, Huang, Hongyan, Graff, Mariaelisa, Choudhury, Ananyo, Sengupta, Dhriti, Mahajan, Anubha, Brown, Michael R, Zhang, Weihua, Yu, Ketian, Schmidt, Ellen M, Pandit, Anita, Gustafsson, Stefan, Yin, Xianyong, Luan, Jian’an, Zhao, Jing-Hua, Matsuda, Fumihiko, Jang, Hye-Mi, Yoon, Kyungheon, Medina-Gomez, Carolina, Pitsillides, Achilleas, Hottenga, Jouke Jan, Wood, Andrew R, Ji, Yingji, Gao, Zishan, Haworth, Simon, Mitchell, Ruth E, Chai, Jin Fang, Aadahl, Mette, Bjerregaard, Anne A, Yao, Jie, Manichaikul, Ani, Lee, Wen-Jane, Hsiung, Chao Agnes, Warren, Helen R, Ramirez, Julia, Bork-Jensen, Jette, Kårhus, Line L, Goel, Anuj, and Sabater-Lleal, Maria
- Subjects
Human Genome ,Biotechnology ,Genetics ,2.1 Biological and endogenous factors ,Aetiology ,Chromatin ,Genome-Wide Association Study ,Genomics ,Humans ,Lipids ,Polymorphism ,Single Nucleotide ,Million Veterans Program ,Global Lipids Genetics Consortium ,complex traits ,fine-mapping ,functional genomics ,lipid biology ,post-GWAS ,regulatory mechanism ,variant prioritization ,Biological Sciences ,Medical and Health Sciences ,Genetics & Heredity - Abstract
A major challenge of genome-wide association studies (GWASs) is to translate phenotypic associations into biological insights. Here, we integrate a large GWAS on blood lipids involving 1.6 million individuals from five ancestries with a wide array of functional genomic datasets to discover regulatory mechanisms underlying lipid associations. We first prioritize lipid-associated genes with expression quantitative trait locus (eQTL) colocalizations and then add chromatin interaction data to narrow the search for functional genes. Polygenic enrichment analysis across 697 annotations from a host of tissues and cell types confirms the central role of the liver in lipid levels and highlights the selective enrichment of adipose-specific chromatin marks in high-density lipoprotein cholesterol and triglycerides. Overlapping transcription factor (TF) binding sites with lipid-associated loci identifies TFs relevant in lipid biology. In addition, we present an integrative framework to prioritize causal variants at GWAS loci, producing a comprehensive list of candidate causal genes and variants with multiple layers of functional evidence. We highlight two of the prioritized genes, CREBRF and RRBP1, which show convergent evidence across functional datasets supporting their roles in lipid biology.
- Published
- 2022
29. Polygenic risk scores of endo-phenotypes identify the effect of genetic background in congenital heart disease
- Author
-
Spendlove, Sarah J, Bondhus, Leroy, Lluri, Gentian, Sul, Jae Hoon, and Arboleda, Valerie A
- Subjects
Biological Sciences ,Genetics ,Congenital Heart Disease ,Pediatric ,Rare Diseases ,Congenital Structural Anomalies ,Cardiovascular ,Human Genome ,Heart Disease ,2.1 Biological and endogenous factors ,Good Health and Well Being ,complex traits ,congenital heart disease ,polygenic risk score ,rare disease - Abstract
Congenital heart disease (CHD) is a rare structural defect that occurs in ∼1% of live births. Studies on CHD genetic architecture have identified pathogenic single-gene mutations in less than 30% of cases. Single-gene mutations often show incomplete penetrance and variable expressivity. Therefore, we hypothesize that genetic background may play a role in modulating disease expression. Polygenic risk scores (PRSs) aggregate effects of common genetic variants to investigate whether, cumulatively, these variants are associated with disease penetrance or severity. However, the major limitations in this field have been in generating sufficient sample sizes for these studies. Here we used CHD-phenotype matched genome-wide association study (GWAS) summary statistics from the UK Biobank (UKBB) as our base study and whole-genome sequencing data from the CHD cohort (n1 = 711 trios, n2 = 362 European trios) of the Gabriella Miller Kids First dataset as our target study to develop PRSs for CHD. PRSs estimated using a GWAS for heart valve problems and heart murmur explain 2.5% of the variance in case-control status of CHD (all SNVs, p = 7.90 × 10-3; fetal cardiac SNVs, p = 8.00 × 10-3) and 1.8% of the variance in severity of CHD (fetal cardiac SNVs, p = 6.20 × 10-3; all SNVs, p = 0.015). These results show that common variants captured in CHD phenotype-matched GWASs have a modest but significant contribution to phenotypic expression of CHD. Further exploration of the cumulative effect of common variants is necessary for understanding the complex genetic etiology of CHD and other rare diseases.
- Published
- 2022
30. Genetic interactions drive heterogeneity in causal variant effect sizes for gene expression and complex traits
- Author
-
Patel, Roshni A, Musharoff, Shaila A, Spence, Jeffrey P, Pimentel, Harold, Tcheandjieu, Catherine, Mostafavi, Hakhamanesh, Sinnott-Armstrong, Nasa, Clarke, Shoa L, Smith, Courtney J, Program, VA Million Veteran, Durda, Peter P, Taylor, Kent D, Tracy, Russell, Liu, Yongmei, Johnson, W Craig, Aguet, Francois, Ardlie, Kristin G, Gabriel, Stacey, Smith, Josh, Nickerson, Deborah A, Rich, Stephen S, Rotter, Jerome I, Tsao, Philip S, Assimes, Themistocles L, and Pritchard, Jonathan K
- Subjects
Epidemiology ,Biological Sciences ,Health Sciences ,Genetics ,Atherosclerosis ,Human Genome ,2.1 Biological and endogenous factors ,Aetiology ,Cholesterol ,LDL ,Gene Expression ,Genome-Wide Association Study ,Humans ,Multifactorial Inheritance ,Polymorphism ,Single Nucleotide ,White People ,V.A. Million Veteran Program ,complex traits ,genetic correlation ,genome-wide association ,population genetics ,statistical genetics ,Medical and Health Sciences ,Genetics & Heredity ,Biological sciences ,Biomedical and clinical sciences ,Health sciences - Abstract
Despite the growing number of genome-wide association studies (GWASs), it remains unclear to what extent gene-by-gene and gene-by-environment interactions influence complex traits in humans. The magnitude of genetic interactions in complex traits has been difficult to quantify because GWASs are generally underpowered to detect individual interactions of small effect. Here, we develop a method to test for genetic interactions that aggregates information across all trait-associated loci. Specifically, we test whether SNPs in regions of European ancestry shared between European American and admixed African American individuals have the same causal effect sizes. We hypothesize that in African Americans, the presence of genetic interactions will drive the causal effect sizes of SNPs in regions of European ancestry to be more similar to those of SNPs in regions of African ancestry. We apply our method to two traits: gene expression in 296 African Americans and 482 European Americans in the Multi-Ethnic Study of Atherosclerosis (MESA) and low-density lipoprotein cholesterol (LDL-C) in 74K African Americans and 296K European Americans in the Million Veteran Program (MVP). We find significant evidence for genetic interactions in our analysis of gene expression; for LDL-C, we observe a similar point estimate, although this is not significant, most likely due to lower statistical power. These results suggest that gene-by-gene or gene-by-environment interactions modify the effect sizes of causal variants in human complex traits.
- Published
- 2022
31. Common genetic variation and spliceosome variants in rare developmental disorders
- Author
-
Wigdor, Emilie and Martin, Hilary
- Subjects
cis-regulatory variants ,common genetic variation ,complex traits ,eQTL ,genetics ,incomplete penetrance ,modified penetrance ,polygenic scores ,rare disease ,rare disorders ,RNA-sequencing ,spliceosome ,splicing ,TWAS - Abstract
Although thousands of rare disorders are caused by single, deleterious, protein- coding variants, evidence suggests that common variants also contribute to risk for rare, neurodevelopmental disorders (NDDs). These are likely affecting the penetrance of protein-coding variants as well as expressivity, posing a major challenge in the interpretation of rare variants. An additional challenge is our incomplete understanding of which variants are likely to affect gene function. Due to the high burden of "variants of unknown significance" (VUS), there is a great need to develop molecular biomarkers of individual disorders which could be used as an intermediate phenotype to help determine whether a VUS is pathogenic or benign. For disorders which are due to mutations in spliceosomal components, global patterns of splicing changes may be a useful biomarker. The research presented falls into three main projects. First, I investigated via genetically-predicted gene expression, whether cis-regulatory variants modify the penetrance of inherited, putatively damaging variants in NDD probands in the Deciphering Developmental Disorders (DDD) Study. To determine whether there were overall differences in predicted gene expression between probands and controls, I conducted a Transcriptome Association Study. I then tested whether the predicted gene expression of genes harbouring inherited, putatively damaging variants, is lower in undiagnosed NDD probands compared to controls. Finally, I investigated the modified penetrance of inherited, putatively damaging variants by comparing predicted gene expression between undiagnosed NDD probands and their unaffected, variant-transmitting parents. Second, I further explored the role of common variants in severe NDDs using polygenic scores (PGS) in both DDD and the Genomics England (GEL) 100,000 Genomes project. I tested whether undiagnosed NDD probands over- or under- inherit PGS for NDDs and correlated traits. I found that NDD probands over-inherit PGS for NDDs and schizophrenia. To put these results into context, I compared unaffected parents of undiagnosed probands' PGS to both controls and probands. I found that parents' PGS are significantly different from controls' PGS, but not from probands. Additionally, I explored sex differences in PGS, by examining both affected and unaffected individuals. I found preliminary evidence of a female protective effect in the context of common variation. Finally, I revisited the question of the modified penetrance of inherited, putatively damaging variants. Third, using whole genome sequencing and bulk RNA sequencing of whole blood from GEL, I investigated differential splicing and gene expression in rare disorder probands with a pathogenic variant in the spliceosome. I found enrichment of differentially expressed genes for processes related to genes containing minor introns. Additionally, I found enrichment of genes involved in spliceosomal components among differentially spliced genes, suggesting a potential feedback loop for regulation of splicing. These studies emphasise the importance of studying the convergence of common and rare variation, as well as the integration of functional data, in the context of rare disease genetics. Moreover, they highlight the need to collect phenotypic and genotypic data on parents and family members of rare disorder probands.
- Published
- 2022
- Full Text
- View/download PDF
32. Co-expression pan-network reveals genes involved in complex traits within maize pan-genome
- Author
-
Cagirici, H Busra, Andorf, Carson M, and Sen, Taner Z
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Genetics ,Human Genome ,Pediatric Research Initiative ,Biotechnology ,Aetiology ,2.1 Biological and endogenous factors ,Zea mays ,Genome-Wide Association Study ,Multifactorial Inheritance ,Phenotype ,Gene Regulatory Networks ,Polymorphism ,Single Nucleotide ,Co-expression network ,Pan-network ,Maize ,Pan-genome ,GWAS ,Complex traits ,Tassel branch number ,Starch ,Microbiology ,Plant Biology ,Crop and Pasture Production ,Plant Biology & Botany ,Crop and pasture production ,Plant biology - Abstract
BackgroundWith the advances in the high throughput next generation sequencing technologies, genome-wide association studies (GWAS) have identified a large set of variants associated with complex phenotypic traits at a very fine scale. Despite the progress in GWAS, identification of genotype-phenotype relationship remains challenging in maize due to its nature with dozens of variants controlling the same trait. As the causal variations results in the change in expression, gene expression analyses carry a pivotal role in unraveling the transcriptional regulatory mechanisms behind the phenotypes.ResultsTo address these challenges, we incorporated the gene expression and GWAS-driven traits to extend the knowledge of genotype-phenotype relationships and transcriptional regulatory mechanisms behind the phenotypes. We constructed a large collection of gene co-expression networks and identified more than 2 million co-expressing gene pairs in the GWAS-driven pan-network which contains all the gene-pairs in individual genomes of the nested association mapping (NAM) population. We defined four sub-categories for the pan-network: (1) core-network contains the highest represented ~ 1% of the gene-pairs, (2) near-core network contains the next highest represented 1-5% of the gene-pairs, (3) private-network contains ~ 50% of the gene pairs that are unique to individual genomes, and (4) the dispensable-network contains the remaining 50-95% of the gene-pairs in the maize pan-genome. Strikingly, the private-network contained almost all the genes in the pan-network but lacked half of the interactions. We performed gene ontology (GO) enrichment analysis for the pan-, core-, and private- networks and compared the contributions of variants overlapping with genes and promoters to the GWAS-driven pan-network.ConclusionsGene co-expression networks revealed meaningful information about groups of co-regulated genes that play a central role in regulatory processes. Pan-network approach enabled us to visualize the global view of the gene regulatory network for the studied system that could not be well inferred by the core-network alone.
- Published
- 2022
33. A Genetic Analysis of Current Medication Use in the UK Biobank.
- Author
-
Rohde, Palle Duun
- Subjects
- *
DRUG side effects , *GENOME-wide association studies , *GENETIC variation , *DRUGS , *GENE frequency - Abstract
Genomics has been forecasted to revolutionise human health by improving medical treatment through a better understanding of the molecular mechanisms of human diseases. Despite great successes of the last decade's genome-wide association studies (GWAS), the results have been translated to genomic medicine to a limited extent. One route to get closer to improved medical treatment could be by understanding the genetics of medication use. Current medication profiles from 335,744 individuals from the UK Biobank were obtained, and a GWAS was conducted to identify common genetic variants associated with current medication use. In total, 59 independent loci were identified for medication use, and approximately 18% of the total variation was attributable to common genetic variation. The largest fraction of genetic variance for current medication use was captured by variants with low-to-medium minor allele frequency, with coding, conserved genomic regions and transcription start sites being enriched for associated variants. The average correlation (R) between medication use and the polygenic score was 0.14. The results further demonstrated that individuals with higher polygenic burden for medication use were, on average, sicker and had a higher risk for adverse drug reactions. These results provide an insight into the genetic contribution of medication use and pave the way for developments of novel multiple trait polygenic scores, which include the genetically informed medication use. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
34. A network‐based approach to understanding gene–biological processes affecting economically important traits of Nelore cattle.
- Author
-
Corrêa, Mariely Simone Lopes, Silva, Evandro Neves, dos Santos, Thaís Cristina Ferreira, Simielli Fonseca, Larissa Fernanda, Magalhães, Ana Fabrícia Braga, Verardo, Lucas Lima, de Albuquerque, Lucia Galvão, and Silva, Danielly Beraldo dos Santos
- Subjects
- *
MEAT quality , *STRIATED muscle , *BIOLOGICAL networks , *CATTLE , *MUSCLE growth , *GENE regulatory networks , *CATTLE carcasses , *LIPID metabolism - Abstract
This study aimed to build gene–biological process networks with differentially expressed genes associated with economically important traits of Nelore cattle from 17 previous studies. The genes were clustered into three groups by evaluated traits: group 1, production traits; group 2, carcass traits; and group 3, meat quality traits. For each group, a gene–biological process network analysis was performed with the differentially expressed genes in common. For production traits, 37 genes were found in common, of which 13 genes were enriched for six Gene Ontology (GO) terms; these terms were not functionally grouped. However, the enriched GO terms were related to homeostasis, the development of muscles and the immune system. For carcass traits, four genes were found in common. Thus, it was not possible to functionally group these genes into a network. For meat quality traits, the analysis revealed 222 genes in common. CSRP3 was the only gene differentially expressed in all three groups. Non‐redundant biological terms for clusters of genes were functionally grouped networks, reflecting the cross‐talk between all biological processes and genes involved. Many biological processes and pathways related to muscles, the immune system and lipid metabolism were enriched, such as striated muscle cell development and triglyceride metabolic processes. This study provides insights into the genetic mechanisms of production, carcass and meat quality traits of Nelore cattle. This information is fundamental for a better understanding of the complex traits and could help in planning strategies for the production and selection systems of Nelore cattle. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
35. Editorial: Advances in statistical methods for the genetic dissection of complex traits in plants.
- Author
-
Yuan-Ming Zhang, Zhenyu Jia, Shang-Qian Xie, Jia Wen, Shibo Wang, and Ya-Wen Zhang
- Subjects
GENOME-wide association studies ,DISSECTION - Published
- 2024
- Full Text
- View/download PDF
36. QTL mapping of alkaloids in tobacco.
- Author
-
LIU Ying-Chao, FANG Dun-Huang, XU Hai-Ming, TONG Zhi-Jun, and XIAO Bing-Guang
- Abstract
Alkaloids are important chemical components in tobacco. In order to understand the genetic architecture of alkaloids in tobacco and identify major effect loci controlling alkaloids related traits, QTL mapping on tobacco alkaloids was performed. A set of 271 recombinant inbred lines (RIL) were constructed with Y3 and K326 as the parents. The RIL population was planted in Yanhe, Yuxi, Yunnan province and Shilin, Kunming, Yunnan province in 2018, 2019, and 2020, respectively. Five alkaloid phenotypes including total plant alkali (TPA), nicotine (NIC), nornicotine (NOR), anabasine (ANAB), and anatabine (ANAT) were measured. A linkage map of 46,129 markers was constructed by genome sequencing of the population. QTL mapping was performed by the software QTLNetwork 2.0 which was developed based on the mixed linear model. A total of 15 QTLs with significant additive effects were identified. The contribution rate of additive effect to the corresponding phenotypes varied from 0.58% to 11.57%. Four major QTLs, qTPA14 for total plant alkali, qNIC14 for nicotine, qANAB14 for anabasine, and qANAT14 for anatabine, accounted for more than 10% of phenotypic variation of the corresponding traits, which were located in linkage group 14. Six QTLs with significant additive-by-environment interaction effects were detected, their additive-by-environment interaction effects explained the phenotypic variation of 0.80%--1.81%. Five pairs of QTLs with significant additive-by-additive epistasis effects were detected, accounting for phenotypic variation from 0.15% to 2.31%, while two pairs of QTLs were detected with significant epistasis-by-environment interaction effects, which explaining the proportion of phenotypic variation from 0.81% to 1.16%. The results pave a foundation for further isolation of candidate genes, the dissection of genetic mechanism, and the molecular improvement of tobacco alkaloid traits. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
37. Genetic effects on molecular network states explain complex traits
- Author
-
Matthias Weith, Jan Großbach, Mathieu Clement‐Ziza, Ludovic Gillet, María Rodríguez‐López, Samuel Marguerat, Christopher T Workman, Paola Picotti, Jürg Bähler, Ruedi Aebersold, and Andreas Beyer
- Subjects
complex traits ,network effects ,PKA signaling ,QTL mapping ,TOR signaling ,Biology (General) ,QH301-705.5 ,Medicine (General) ,R5-920 - Abstract
Abstract The complexity of many cellular and organismal traits results from the integration of genetic and environmental factors via molecular networks. Network structure and effect propagation are best understood at the level of functional modules, but so far, no concept has been established to include the global network state. Here, we show when and how genetic perturbations lead to molecular changes that are confined to small parts of a network versus when they lead to modulation of network states. Integrating multi‐omics profiling of genetically heterogeneous budding and fission yeast strains with an array of cellular traits identified a central state transition of the yeast molecular network that is related to PKA and TOR (PT) signaling. Genetic variants affecting this PT state globally shifted the molecular network along a single‐dimensional axis, thereby modulating processes including energy and amino acid metabolism, transcription, translation, cell cycle control, and cellular stress response. We propose that genetic effects can propagate through large parts of molecular networks because of the functional requirement to centrally coordinate the activity of fundamental cellular processes.
- Published
- 2023
- Full Text
- View/download PDF
38. Quantifying the contribution of dominance deviation effects to complex trait variation in biobank-scale data
- Author
-
Pazokitoroudi, Ali, Chiu, Alec M, Burch, Kathryn S, Pasaniuc, Bogdan, and Sankararaman, Sriram
- Subjects
Epidemiology ,Biological Sciences ,Health Sciences ,Genetics ,Human Genome ,Biological Specimen Banks ,Datasets as Topic ,Female ,Genes ,Dominant ,Genetic Variation ,Humans ,Male ,Models ,Genetic ,Multifactorial Inheritance ,Polymorphism ,Single Nucleotide ,additive ,biobank ,complex traits ,dominance ,genetic variation ,heritability ,mixed models ,variance components ,Medical and Health Sciences ,Genetics & Heredity ,Biological sciences ,Biomedical and clinical sciences ,Health sciences - Abstract
The proportion of variation in complex traits that can be attributed to non-additive genetic effects has been a topic of intense debate. The availability of biobank-scale datasets of genotype and trait data from unrelated individuals opens up the possibility of obtaining precise estimates of the contribution of non-additive genetic effects. We present an efficient method to estimate the variation in a complex trait that can be attributed to additive (additive heritability) and dominance deviation (dominance heritability) effects across all genotyped SNPs in a large collection of unrelated individuals. Over a wide range of genetic architectures, our method yields unbiased estimates of additive and dominance heritability. We applied our method, in turn, to array genotypes as well as imputed genotypes (at common SNPs with minor allele frequency [MAF] > 1%) and 50 quantitative traits measured in 291,273 unrelated white British individuals in the UK Biobank. Averaged across these 50 traits, we find that additive heritability on array SNPs is 21.86% while dominance heritability is 0.13% (about 0.48% of the additive heritability) with qualitatively similar results for imputed genotypes. We find no statistically significant evidence for dominance heritability (p
- Published
- 2021
39. The impact of identity by descent on fitness and disease in dogs
- Author
-
Mooney, Jazlyn A, Yohannes, Abigail, and Lohmueller, Kirk E
- Subjects
Agricultural ,Veterinary and Food Sciences ,Biological Sciences ,Evolutionary Biology ,Genetics ,Animals ,Dogs ,Genetic Fitness ,Genetic Variation ,Genome ,Genotype ,Health ,Homozygote ,Inbreeding ,Inbreeding Depression ,Inheritance Patterns ,Multifactorial Inheritance ,Polymorphism ,Single Nucleotide ,Selective Breeding ,inbreeding depression ,fitness ,deleterious mutations ,complex traits - Abstract
Domestic dogs have experienced population bottlenecks, recent inbreeding, and strong artificial selection. These processes have simplified the genetic architecture of complex traits, allowed deleterious variation to persist, and increased both identity-by-descent (IBD) segments and runs of homozygosity (ROH). As such, dogs provide an excellent model for examining how these evolutionary processes influence disease. We assembled a dataset containing 4,414 breed dogs, 327 village dogs, and 380 wolves genotyped at 117,288 markers and data for clinical and morphological phenotypes. Breed dogs have an enrichment of IBD and ROH, relative to both village dogs and wolves, and we use these patterns to show that breed dogs have experienced differing severities of bottlenecks in their recent past. We then found that ROH burden is associated with phenotypes in breed dogs, such as lymphoma. We next test the prediction that breeds with greater ROH have more disease alleles reported in the Online Mendelian Inheritance in Animals (OMIA). Surprisingly, the number of causal variants identified correlates with the popularity of that breed rather than the ROH or IBD burden, suggesting an ascertainment bias in OMIA. Lastly, we use the distribution of ROH across the genome to identify genes with depletions of ROH as potential hotspots for inbreeding depression and find multiple exons where ROH are never observed. Our results suggest that inbreeding has played a large role in shaping genetic and phenotypic variation in dogs and that future work on understudied breeds may reveal new disease-causing variation.
- Published
- 2021
40. Negative selection on complex traits limits phenotype prediction accuracy between populations
- Author
-
Durvasula, Arun and Lohmueller, Kirk E
- Subjects
Epidemiology ,Biological Sciences ,Health Sciences ,Genetics ,Human Genome ,Aetiology ,2.5 Research design and methodologies (aetiology) ,Generic health relevance ,Africa ,Computer Simulation ,Datasets as Topic ,Europe ,Genetic Predisposition to Disease ,Genetic Variation ,Genetics ,Population ,Humans ,Models ,Genetic ,Multifactorial Inheritance ,Phenotype ,Population Growth ,Selection ,Genetic ,United Kingdom ,complex traits ,negative selection ,polygenic scores ,population genetics ,population history ,risk prediction ,simulations ,Medical and Health Sciences ,Genetics & Heredity ,Biological sciences ,Biomedical and clinical sciences ,Health sciences - Abstract
Phenotype prediction is a key goal for medical genetics. Unfortunately, most genome-wide association studies are done in European populations, which reduces the accuracy of predictions via polygenic scores in non-European populations. Here, we use population genetic models to show that human demographic history and negative selection on complex traits can result in population-specific genetic architectures. For traits where alleles with the largest effect on the trait are under the strongest negative selection, approximately half of the heritability can be accounted for by variants in Europe that are absent from Africa, leading to poor performance in phenotype prediction across these populations. Further, under such a model, individuals in the tails of the genetic risk distribution may not be identified via polygenic scores generated in another population. We empirically test these predictions by building a model to stratify heritability between European-specific and shared variants and applied it to 37 traits and diseases in the UK Biobank. Across these phenotypes, ∼30% of the heritability comes from European-specific variants. We conclude that genetic association studies need to include more diverse populations to enable the utility of phenotype prediction in all populations.
- Published
- 2021
41. Effects of HIV Infection on Arterial Endothelial Function
- Author
-
Stein, James H, Kime, Noah, Korcarz, Claudia E, Ribaudo, Heather, Currier, Judith S, and Delaney, Joseph C
- Subjects
Biomedical and Clinical Sciences ,Cardiovascular Medicine and Haematology ,Clinical Sciences ,Sexually Transmitted Infections ,Clinical Research ,HIV/AIDS ,Prevention ,Infectious Diseases ,Cardiovascular ,Infection ,Good Health and Well Being ,AIDS Serodiagnosis ,AIDS-Associated Nephropathy ,Adolescent ,Adult ,Aged ,Brachial Artery ,Cardiovascular Diseases ,Case-Control Studies ,Endothelium ,Vascular ,Female ,HIV Infections ,HIV Seronegativity ,HIV Seropositivity ,Heart Disease Risk Factors ,Humans ,Male ,Middle Aged ,Predictive Value of Tests ,Risk Assessment ,Severity of Illness Index ,Vasodilation ,Young Adult ,arteries ,cardiovascular diseases ,creatinine ,human immunodeficiency virus ,viremia ,complex traits ,Genome Wide Association Studies ,lincRNAs ,syntenic conservation ,Cardiorespiratory Medicine and Haematology ,Cardiovascular System & Hematology ,Cardiovascular medicine and haematology ,Clinical sciences - Abstract
ObjectiveTo determine the effects of HIV serostatus and disease severity on endothelial function in a large pooled cohort study of people living with HIV infection and HIV- controls. Approach and Results: We used participant-level data from 9 studies: 7 included people living with HIV (2 treatment-naïve) and 4 had HIV- controls. Brachial artery flow-mediated dilation (FMD) was measured using a standardized ultrasound imaging protocol with central reading. After data harmonization, multiple linear regression was used to examine the effects of HIV- serostatus, HIV disease severity measures, and cardiovascular disease risk factors on FMD. Of 2533 participants, 986 were people living with HIV (mean 44.4 [SD 11.8] years old) and 1547 were HIV- controls (42.9 [12.2] years old). The strongest and most consistent associates of FMD were brachial artery diameter, age, sex, and body mass index. The effect of HIV+ serostatus on FMD was strongly influenced by kidney function. In the highest tertile of creatinine (1.0 mg/dL), the effect of HIV+ serostatus was strong (β=-1.59% [95% CI, -2.58% to -0.60%], P=0.002), even after covariate adjustment (β=-1.36% [95% CI, -2.46% to -0.47%], P=0.003). In the lowest tertile (0.8 mg/dL), the effect of HIV+ serostatus was strong (β=-1.90% [95% CI, -2.58% to -1.21%], P
- Published
- 2021
42. Cost-Effective Mapping of Genetic Interactions in Mammalian Cells
- Author
-
Khan, Arshad H and Smith, Desmond J
- Subjects
Biological Sciences ,Genetics ,Biotechnology ,2.1 Biological and endogenous factors ,Aetiology ,cancer ,cell growth ,complex traits ,gene interactions ,GWAS ,radiation hybrid ,genetic variants ,copy number variants ,Clinical Sciences ,Law - Abstract
Comprehensive maps of genetic interactions in mammalian cells are daunting to construct because of the large number of potential interactions, ~ 2 × 108 for protein coding genes. We previously used co-inheritance of distant genes from published radiation hybrid (RH) datasets to identify genetic interactions. However, it was necessary to combine six legacy datasets from four species to obtain adequate statistical power. Mapping resolution was also limited by the low density PCR genotyping. Here, we employ shallow sequencing of nascent human RH clones as an economical approach to constructing interaction maps. In this initial study, 15 clones were analyzed, enabling construction of a network with 225 genes and 2,359 interactions (FDR < 0.05). Despite its small size, the network showed significant overlap with the previous RH network and with a protein-protein interaction network. Consumables were ≲$50 per clone, showing that affordable, high quality genetic interaction maps are feasible in mammalian cells.
- Published
- 2021
43. Testing associations between human anxiety and genes previously implicated by mouse anxiety models.
- Author
-
Brasher, Maizy S., Mize, Travis J., Thomas, Aimee L., Hoeffer, Charles A., Ehringer, Marissa A., and Evans, Luke M.
- Subjects
- *
HUMAN genes , *HUMAN genetic variation , *LABORATORY mice , *ANIMAL disease models , *GENE expression , *PHENOTYPES - Abstract
Anxiety disorders are common and can be debilitating, with effective treatments remaining hampered by an incomplete understanding of the underlying genetic etiology. Improvements have been made in understanding the genetic influences on mouse behavioral models of anxiety, yet it is unclear the extent to which genes identified in these experimental systems contribute to genetic variation in human anxiety phenotypes. Leveraging new and existing large‐scale human genome‐wide association studies, we tested whether sets of genes previously identified in mouse anxiety‐like behavior studies contribute to a range of human anxiety disorders. When tested as individual genes, 13 mouse‐identified genes were associated with human anxiety phenotypes, suggesting an overlap of individual genes contributing to both mouse models of anxiety‐like behaviors and human anxiety traits. When genes were tested as sets, we did identify 14 significant associations between mouse gene sets and human anxiety, but the majority of gene sets showed no significant association with human anxiety phenotypes. These few significant associations indicate a need to identify and develop more translatable mouse models by identifying sets of genes that "match" between model systems and specific human phenotypes of interest. We suggest that continuing to develop improved behavioral paradigms and finer‐scale experimental data, for instance from individual neuronal subtypes or cell‐type‐specific expression data, is likely to improve our understanding of the genetic etiology and underlying functional changes in anxiety disorders. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
44. Benchmarking of local genetic correlation estimation methods using summary statistics from genome-wide association studies.
- Author
-
Zhang, Chi, Zhang, Yiliang, Zhang, Yunxuan, and Zhao, Hongyu
- Subjects
- *
GENOME-wide association studies , *GENETIC correlations , *GENETIC variation , *LINKAGE disequilibrium - Abstract
Local genetic correlation evaluates the correlation of additive genetic effects between different traits across the same genetic variants at a genomic locus. It has been proven informative for understanding the genetic similarities of complex traits beyond that captured by global genetic correlation calculated across the whole genome. Several summary-statistics-based approaches have been developed for estimating local genetic correlation, including |$\rho$| -hess, SUPERGNOVA and LAVA. However, there has not been a comprehensive evaluation of these methods to offer practical guidelines on the choices of these methods. In this study, we conduct benchmark comparisons of the performance of these three methods through extensive simulation and real data analyses. We focus on two technical difficulties in estimating local genetic correlation: sample overlaps across traits and local linkage disequilibrium (LD) estimates when only the external reference panels are available. Our simulations suggest the likelihood of incorrectly identifying correlated regions and local correlation estimation accuracy are highly dependent on the estimation of the local LD matrix. These observations are corroborated by real data analyses of 31 complex traits. Overall, our findings illuminate the distinct results yielded by different methods applied in post-genome-wide association studies (post-GWAS) local correlation studies. We underscore the sensitivity of local genetic correlation estimates and inferences to the precision of local LD estimation. These observations accentuate the vital need for ongoing refinement in methodologies. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
45. Genetic Basis of Aerobically Supported Voluntary Exercise: Results from a Selection Experiment with House Mice.
- Author
-
Hillis, David, Yadgary, Liran, Weinstock, George, Pardo-Manuel de Villena, Fernando, Pomp, Daniel, Fowler, Alexandra, Xu, Shizhong, Chan, Frank, and Garland, Theodore
- Subjects
artificial selection ,behavior ,complex traits ,experimental evolution ,population differentiation ,Animals ,Cadherins ,Chromosomes ,Directed Molecular Evolution ,Eye Proteins ,Female ,Hybridization ,Genetic ,Male ,Membrane Transport Proteins ,Mice ,Mice ,Inbred ICR ,Multifactorial Inheritance ,Polymorphism ,Single Nucleotide ,Receptors ,LDL ,Running ,Selection ,Genetic - Abstract
The biological basis of exercise behavior is increasingly relevant for maintaining healthy lifestyles. Various quantitative genetic studies and selection experiments have conclusively demonstrated substantial heritability for exercise behavior in both humans and laboratory rodents. In the High Runner selection experiment, four replicate lines of Mus domesticus were bred for high voluntary wheel running (HR), along with four nonselected control (C) lines. After 61 generations, the genomes of 79 mice (9-10 from each line) were fully sequenced and single nucleotide polymorphisms (SNPs) were identified. We used nested ANOVA with MIVQUE estimation and other approaches to compare allele frequencies between the HR and C lines for both SNPs and haplotypes. Approximately 61 genomic regions, across all somatic chromosomes, showed evidence of differentiation; 12 of these regions were differentiated by all methods of analysis. Gene function was inferred largely using Panther gene ontology terms and KO phenotypes associated with genes of interest. Some of the differentiated genes are known to be associated with behavior/motivational systems and/or athletic ability, including Sorl1, Dach1, and Cdh10 Sorl1 is a sorting protein associated with cholinergic neuron morphology, vascular wound healing, and metabolism. Dach1 is associated with limb bud development and neural differentiation. Cdh10 is a calcium ion binding protein associated with phrenic neurons. Overall, these results indicate that selective breeding for high voluntary exercise has resulted in changes in allele frequencies for multiple genes associated with both motivation and ability for endurance exercise, providing candidate genes that may explain phenotypic changes observed in previous studies.
- Published
- 2020
46. Genomic Prediction Informed by Biological Processes Expands Our Understanding of the Genetic Architecture Underlying Free Amino Acid Traits in Dry Arabidopsis Seeds.
- Author
-
Turner-Hissong, Sarah D, Bird, Kevin A, Lipka, Alexander E, King, Elizabeth G, Beissinger, Timothy M, and Angelovici, Ruthie
- Subjects
Humans ,Arabidopsis ,Seeds ,Amino Acids ,Genomics ,Biological Phenomena ,Plant Breeding ,GenPred ,MultiBLUP ,Shared Data Resources ,amino acids ,complex traits ,genomic prediction ,Biotechnology ,Genetics ,Human Genome - Abstract
Plant growth, development, and nutritional quality depends upon amino acid homeostasis, especially in seeds. However, our understanding of the underlying genetics influencing amino acid content and composition remains limited, with only a few candidate genes and quantitative trait loci identified to date. Improved knowledge of the genetics and biological processes that determine amino acid levels will enable researchers to use this information for plant breeding and biological discovery. Toward this goal, we used genomic prediction to identify biological processes that are associated with, and therefore potentially influence, free amino acid (FAA) composition in seeds of the model plant Arabidopsis thaliana Markers were split into categories based on metabolic pathway annotations and fit using a genomic partitioning model to evaluate the influence of each pathway on heritability explained, model fit, and predictive ability. Selected pathways included processes known to influence FAA composition, albeit to an unknown degree, and spanned four categories: amino acid, core, specialized, and protein metabolism. Using this approach, we identified associations for pathways containing known variants for FAA traits, in addition to finding new trait-pathway associations. Markers related to amino acid metabolism, which are directly involved in FAA regulation, improved predictive ability for branched chain amino acids and histidine. The use of genomic partitioning also revealed patterns across biochemical families, in which serine-derived FAAs were associated with protein related annotations and aromatic FAAs were associated with specialized metabolic pathways. Taken together, these findings provide evidence that genomic partitioning is a viable strategy to uncover the relative contributions of biological processes to FAA traits in seeds, offering a promising framework to guide hypothesis testing and narrow the search space for candidate genes.
- Published
- 2020
47. Convergent evolution of biochemical mechanisms in the bioluminescence systems of ostracods, toadfishes, and brittle stars
- Author
-
Lau, Emily
- Subjects
Evolution & development ,Biochemistry ,Molecular biology ,bioluminescence ,complex traits ,convergent evolution ,luciferase ,molecular evolution ,parallel evolution - Abstract
Convergent evolution, the phylogenetically independent evolution of similar traits, offers a valuable avenue for investigating how complex traits originate and the predictability of evolution. Bioluminescence, the biological production of light by a living organism, is an excellent system for addressing these questions. Bioluminescence convergently evolved over 94 times, is morphologically and functionally diverse, and is encoded by many genes that can be functionally tested in the laboratory. In this dissertation, I begin by proposing an integrative approach to studying the convergent evolution of complex traits, focusing particularly on bioluminescence. Then, I provide evidence for the genetic basis of various biochemical mechanisms underlying three different bioluminescence systems. First, I identify a gene that may be used to modulate the availability of the bioluminescent substrate in an ostracod crustacean. I synthesize this result with previous studies on fireflies and sea pansies to show that these three distantly related taxa independently recruited members of an ancient gene family to modulate their bioluminescent substrates. Second, I identify a gene encoding a structural protein in the lens of light-producing organs in toadfishes. This gene may have originated in fish genomes via an ancient horizontal gene transfer from bacteria. After being maintained in fish genomes for over 300 million years, this gene was recruited to produce the lens of light-producing organs in toadfishes. Finally, I identify the gene encoding a bioluminescent protein in brittle stars. I provide functional evidence supporting the repeated evolution of bioluminescent proteins from the haloalkane dehalogenase gene family, which may have originated in metazoans from a horizontal gene transfer from bacteria. Altogether, my work illustrates how evolutionary convergence may recruit homologous and non-homologous genes, depending on the convergent function or structure, and highlights how ancient horizontal gene transfers may have long-term evolutionary implications for evolving novel structures and functions.
- Published
- 2024
48. Quantitative genetics of complex traits : solutions for studying the genetic basis of variation in yeast
- Author
-
Hu, Yue
- Subjects
576.5 ,microbiology ,yeast ,complex traits ,Quantitative genetics - Abstract
Recent advances in high-throughput techniques for DNA sequencing and phenotyping have greatly facilitated the identification of genetic variants underlying traits at a genomewide level. In this study, a large amount of yeast genetic resources and phenotypic data were collected for the study of natural genetic variation in yeast under different environment conditions. Quantitative trait locus (QTL) analysis and epistasis analysis have been applied to Saccharomyces cerevisiae on 6 groups of 1st generation bi-parental inter-cross segregants and 12th generation multiparental high resolution segregants. Using yeast as model organism, growth under stress conditions of a variety of conventional genotoxic agents was measured. Different QTLs were mapped to causative genes that are related to DNA repair and protein transport. In addition, by comparing the genes identified under 19 different agents, 14 frequently occurring genes producing effect on the growth of yeast, were further analysed. QTL output was clustered through a changepoint model for improving the selection of candidate genes in large gene sets. Furthermore, Temporal QTL analysis was applied to study the dynamic development of yeast growth under X-ray irradiation that expands the phenotype in the time dimension. By comparing the QTL in different time spans, genes that only exhibit effects for a certain period of time rather than continuously through, or at the end of, the experiment were found. One of the major industrial applications of yeast is brewing. In this project, whole genome sequencing analysis were performed on a highly diverse 12th generation de novo hybrid population. Variant calling was applied for these pool sequencing and identification of genetic variants. Pool QTL analysis was applied to compare the allele frequency difference of extreme pools under the same condition. Multiple QTL intervals responding to the brewing environment were identified. This provides useful genetic insights for brewing yeast breeding and improvement.
- Published
- 2020
- Full Text
- View/download PDF
49. Systems genetics approaches for understanding complex traits with relevance for human disease
- Author
-
Hooman Allayee, Charles R Farber, Marcus M Seldin, Evan Graehl Williams, David E James, and Aldons J Lusis
- Subjects
systems genetics ,complex traits ,omics ,mouse models ,human populations ,Medicine ,Science ,Biology (General) ,QH301-705.5 - Abstract
Quantitative traits are often complex because of the contribution of many loci, with further complexity added by environmental factors. In medical research, systems genetics is a powerful approach for the study of complex traits, as it integrates intermediate phenotypes, such as RNA, protein, and metabolite levels, to understand molecular and physiological phenotypes linking discrete DNA sequence variation to complex clinical and physiological traits. The primary purpose of this review is to describe some of the resources and tools of systems genetics in humans and rodent models, so that researchers in many areas of biology and medicine can make use of the data.
- Published
- 2023
- Full Text
- View/download PDF
50. Genome-wide screen identifies host loci that modulate Mycobacterium tuberculosis fitness in immunodivergent mice.
- Author
-
Meade, Rachel K., Long, Jarukit E., Jinich, Adrian, Rhee, Kyu Y., Ashbrook, David G., Williams, Robert W., Sassetti, Christopher M., and Smith, Clare M.
- Subjects
- *
MYCOBACTERIUM tuberculosis , *LOCUS (Genetics) , *SCIENTIFIC community , *MEDICAL screening , *BACTERIAL genetics , *BACTERIAL genes - Abstract
Genetic differences among mammalian hosts and among strains of Mycobacterium tuberculosis (Mtb) are well-established determinants of tuberculosis (TB) patient outcomes. The advent of recombinant inbred mouse panels and next-generation transposon mutagenesis and sequencing approaches has enabled dissection of complex host-pathogen interactions. To identify host and pathogen genetic determinants of Mtb pathogenesis, we infected members of the highly diverse BXD family of strains with a comprehensive library of Mtb transposon mutants (TnSeq). Members of the BXD family segregate for Mtb-resistant C57BL/6J (B6 or B) and Mtb-susceptible DBA/2J (D2 or D) haplotypes. The survival of each bacterial mutant was quantified within each BXD host, and we identified those bacterial genes that were differentially required for Mtb fitness across BXD genotypes. Mutants that varied in survival among the host family of strains were leveraged as reporters of "endophenotypes," each bacterial fitness profile directly probing specific components of the infection microenvironment. We conducted quantitative trait loci (QTL) mapping of these bacterial fitness endophenotypes and identified 140 host-pathogen QTL (hpQTL). We located a QTL hotspot on chromosome 6 (75.97-88.58 Mb) associated with the genetic requirement of multiple Mtb genes: Rv0127 (mak), Rv0359 (rip2), Rv0955 (perM), and Rv3849 (espR). Together, this screen reinforces the utility of bacterial mutant libraries as precise reporters of the host immunological microenvironment during infection and highlights specific host-pathogen genetic interactions for further investigation. To enable downstream follow-up for both bacterial and mammalian genetic research communities, all bacterial fitness profiles have been deposited into GeneNetwork.org and added into the comprehensive collection of TnSeq libraries in MtbTnDB. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.