Back to Search Start Over

Type 2 diabetes genetic loci informed by multi-trait associations point to disease mechanisms and subtypes: A soft clustering analysis

Authors :
Kyle J. Gaulton
Joshua Chiou
Jose C. Florez
Joanne B. Cole
Gad Getz
Miriam S. Udler
Jaegil Kim
Marcin von Grotthuss
Michael Boehnke
Markku Laakso
Benjamin Glaser
Gil Atzmon
Josep M. Mercader
Jason Flannick
Sílvia Bonàs-Guarch
Source :
PLoS Medicine, PLoS Medicine, Vol 15, Iss 9, p e1002654 (2018)
Publication Year :
2018
Publisher :
Public Library of Science (PLoS), 2018.

Abstract

Background Type 2 diabetes (T2D) is a heterogeneous disease for which (1) disease-causing pathways are incompletely understood and (2) subclassification may improve patient management. Unlike other biomarkers, germline genetic markers do not change with disease progression or treatment. In this paper, we test whether a germline genetic approach informed by physiology can be used to deconstruct T2D heterogeneity. First, we aimed to categorize genetic loci into groups representing likely disease mechanistic pathways. Second, we asked whether the novel clusters of genetic loci we identified have any broad clinical consequence, as assessed in four separate subsets of individuals with T2D. Methods and findings In an effort to identify mechanistic pathways driven by established T2D genetic loci, we applied Bayesian nonnegative matrix factorization (bNMF) clustering to genome-wide association study (GWAS) results for 94 independent T2D genetic variants and 47 diabetes-related traits. We identified five robust clusters of T2D loci and traits, each with distinct tissue-specific enhancer enrichment based on analysis of epigenomic data from 28 cell types. Two clusters contained variant-trait associations indicative of reduced beta cell function, differing from each other by high versus low proinsulin levels. The three other clusters displayed features of insulin resistance: obesity mediated (high body mass index [BMI] and waist circumference [WC]), “lipodystrophy-like” fat distribution (low BMI, adiponectin, and high-density lipoprotein [HDL] cholesterol, and high triglycerides), and disrupted liver lipid metabolism (low triglycerides). Increased cluster genetic risk scores were associated with distinct clinical outcomes, including increased blood pressure, coronary artery disease (CAD), and stroke. We evaluated the potential for clinical impact of these clusters in four studies containing individuals with T2D (Metabolic Syndrome in Men Study [METSIM], N = 487; Ashkenazi, N = 509; Partners Biobank, N = 2,065; UK Biobank [UKBB], N = 14,813). Individuals with T2D in the top genetic risk score decile for each cluster reproducibly exhibited the predicted cluster-associated phenotypes, with approximately 30% of all individuals assigned to just one cluster top decile. Limitations of this study include that the genetic variants used in the cluster analysis were restricted to those associated with T2D in populations of European ancestry. Conclusion Our approach identifies salient T2D genetically anchored and physiologically informed pathways, and supports the use of genetics to deconstruct T2D heterogeneity. Classification of patients by these genetic pathways may offer a step toward genetically informed T2D patient management.<br />Using a clustering Bayesian approach applied to GWAS, Jose Florez and colleagues identify traits and loci associated with type 2 diabetes that may be used to classify patients.<br />Author summary Why was this study done? Clinical experience and physiological phenotyping suggest that type 2 diabetes (T2D) is a heterogeneous disease. Recent studies leveraged phenotypic data to identify subtypes of individuals with T2D but have not included genetic data as a driving component of the clustering process. Previous efforts to cluster T2D genetic loci used unsupervised hierarchical clustering, a “hard clustering” method that restricts the membership of a given genetic locus to a single cluster. What did the researchers do and find? We applied a novel “soft clustering” method termed Bayesian nonnegative matrix factorization to cluster variant-trait associations ascertained from publicly available genome-wide association studies for 94 known T2D variants and 47 diabetes-related traits. We identified five novel robust clusters of T2D loci, two related to insulin deficiency and three related to insulin resistance, and showed that these clusters are differentially enriched for relevant tissue-specific enhancers and promoters. In up to 17,365 individuals with T2D from four distinct studies, about 30% of individuals have a genetic risk score in the top decile of uniquely one cluster, and these individuals have cluster-specific phenotypic traits that distinguish them from others with T2D. What do these findings mean? Soft clustering of genetic loci associated with T2D produced biologically supported mechanistic pathways, illuminating how loci might impact T2D risk. Classification of individuals by these genetic pathways may offer a step toward genetically anchored but physiologically informed diagnosis, surveillance, and management of patients with T2D.

Details

ISSN :
15491676
Volume :
15
Database :
OpenAIRE
Journal :
PLOS Medicine
Accession number :
edsair.doi.dedup.....cfc4123a940455a402b563b2e5c02429