Background: In Canada, one in seven adults has diabetes (i.e., 2.3 million) and the lifetime risk of developing diabetes is approximately 30% by age 65. Although 30% of patients admitted to the hospital have diabetes, data from inpatient hospitalizations for patients with diabetes are lacking, both in Canada and globally., Objective: To validate International Classification of Diseases 10th edition Canadian version (ICD-10-CA) codes for the identification of patients with diabetes, to create a multicenter database of patients with diabetes hospitalized under internal medicine in Ontario, and to determine their baseline characteristics, medication use, and admission characteristics., Study Design: We created a database of people who had diabetes and were hospitalized between 2010 and 2020 at 8 hospitals in Ontario that were part of the General Medicine Inpatient Initiative (GEMINI) hospital data-sharing network. Patients who had diabetes were identified using chart review, based upon either (i) a previous physician diagnosis of diabetes, (ii) a recorded hemoglobin A1c ≥ 6.5% or (iii) outpatient prescription of a diabetes medication preceding the hospitalization. The test characteristics of ICD-10-CA codes for diabetes were evaluated. We compared baseline demographics, medication use and hospitalization details among patients with and without diabetes. For hospitalization details, we collected information on the admission diagnosis, comorbidity index, length of stay, receipt of ICU-level care, and inpatient mortality., Results: There were 384,588 admissions within the total study cohort, of which 118,987 (30.9%) had an ICD-10-CA diagnosis code of diabetes (E10.x, E11.x, E13.x, E14.x). The sensitivity and specificity of ICD-10-CA diagnostic codes was 95.9% and 98.8%, respectively. Most patients with an ICD-10-CA code for diabetes had a code for type 2 diabetes (93.9%) and a code for type 1 diabetes was rare (6.1%). The mean age was 66.4 years for patients without diabetes and 71.3 years for those with an ICD-10-CA diagnosis code for diabetes. Patients with diabetes had a higher prevalence of hypertension (64% vs. 37.9%), coronary artery disease (28.7% vs. 15.3%), heart failure (24.5% vs. 12.1%) and renal failure (33.8% vs. 17.3%) in comparison to those without diabetes. The most prevalent diabetes medications received in hospital were metformin (43%), DPP4 inhibitors (22.7%) and sulfonylureas (18.8%). The most common reason for admission among patients with diabetes was heart failure (9.0%), and among patients without diabetes was pneumonia (7.8%). Median length of stay was longer for patients with diabetes (5.5 vs. 4.5 days) and in-hospital mortality was similar between groups (6.8% with diabetes vs. 6.5% without diabetes)., Importance: Diabetes is one of the most prevalent chronic medical conditions, affecting roughly one third of all patients hospitalized on an internal medicine ward and is associated with other comorbidities and longer hospital stays. ICD-10-CA codes were highly accurate in identifying patients with diabetes. The development of an inpatient cohort will allow for further study of in-hospital practices and outcomes among patients with diabetes., Competing Interests: MF was a consultant for ProofDx, a start-up company creating a point of care diagnostic test for COVID-19. MF is an advisor for Signal1, a start-up company deploying machine learned models to improve inpatient care. TBS was a steering committee member of the Amgen-financed GALACTIC-HF trial, of the Boehringer Ingelheim financed SHARP3 trial, of “LUX-Dx TRENDS Evaluates Diagnostics Sensors in Heart Failure Patients Receiving Boston Scientific's Investigational ICM System” trial; advisory board: Amgen, CSL Seqirus and GSK; chief investigator of the Sanofi Pasteur financed “NUDGE-FLU”, “DANFLU-1”, and “DANFLU-2” trials; speaker honorarium: Bayer, Novartis, Sanofi Pasteur, GE healthcare and GSK; consultant appointments: Novo Nordisk, IQVIA and Parexel; and research grants: GE Healthcare, AstraZeneca, Novo Nordisk and Sanofi Pasteur. Outside of the submitted work, AV is a part-time employee of Ontario Health and a co-inventor of an AI-based early warning system for patient deterioration that was acquired by the start-up company Signal1. FR is a part-time employee of Ontario Health. The development of the GEMINI data platform has been supported with funding from sources such as the Canadian Cancer Society, the Canadian Frailty Network, the Canadian Institutes of Health Research, the Canadian Medical Protective Agency, Green Shield Canada Foundation, the Natural Sciences and Engineering Research Council of Canada, Ontario Health, the St. Michael’s Hospital Association Innovation Fund, the University of Toronto Department of Medicine, and in-kind support from partner hospitals and the Vector Institute. FR holds a salary award from Canada Research Chairs Program, University of Toronto and the PSI Graham Farquharson Knowledge Translation Fellowship. No funding sources from FR influenced the writing of this manuscript. There are no patents, products in development or marketed products to declare. This does not alter our adherence to PLOS ONE policies on sharing data and materials. All other authors have no reported conflicts of interest., (Copyright: © 2024 Colacci et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.)