Author: "Sengwee, Toh" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Sengwee, Toh"' showing total 263 results

Start Over Author "Sengwee, Toh"

263 results on '"Sengwee, Toh"'

1. Analyzing missingness patterns in real-world data using the SMDI toolkit: application to a linked EHR-claims pharmacoepidemiology study

Author: Sudha R. Raman, Bradley G. Hammill, Pamela A. Shaw, Hana Lee, Sengwee Toh, John G. Connolly, Kimberly J. Dandreo, Vinit Nalawade, Fang Tian, Wei Liu, Jie Li, José J. Hernández-Muñoz, Robert J. Glynn, Rishi J. Desai, and Janick Weberpals
Subjects: Missing data, Electronic health records, Real-world evidence, Medicine (General), R5-920
Abstract: Abstract Background Missing data in confounding variables present a frequent challenge in generating evidence using real-world data, including electronic health records (EHR). Our objective was to apply a recently published toolkit for characterizing missing data patterns and based on the toolkit results about likely missingness mechanisms, illustrate the decision-making process for analyses in an empirical case example. Methods We utilized the Structural Missing Data Investigations (SMDI) toolkit to characterize missing data patterns in the context of a pharmacoepidemiology study comparing cardiovascular outcomes of initiating sodium-glucose-cotransporter-2 inhibitors (SGLT2i) and dipeptidyl peptidase‐4 inhibitors (DPP‐4i) among older adults. The study used a linked EHR-Medicare claims dataset from Duke Health patients (2015–2017), focusing on partially observed confounders from EHR data (HbA1c lab and body mass index [BMI] values). Our analysis incorporated SMDI's descriptive functions and diagnostic tests to explore missingness patterns and determine missingness mitigation approaches. We used findings from these investigations to inform estimation of adjusted hazard ratios comparing the two classes of medications. Results High levels of missingness were noted for important confounding variables including HbA1c (63.6%) and BMI (16.5%). Diagnostic tests resulted in output that described: 1) the distributions of patient characteristics, exposure, and outcome between patients with or without an observed value of the partially observed covariate, 2) the ability to predict missingness based on observed covariates, and 3) estimate if the missingness of a partially observed covariate is differential with respect to the outcome. There was evidence that missingness could be sufficiently described using observed data, which allowed multiple imputation by chained equations using random forests to address missing confounder data in estimating treatment effects. Multiple imputation resulted in improved alignment of effect estimates with previous studies. Conclusions We were able to demonstrate the practical application of the SMDI toolkit in a real-world setting. Application of the SMDI toolkit and the resulting insights of potential missingness patterns can inform the choice of appropriate analytic methods and increase transparency of research methods in handling missing data. This type of approach can inform analytic decision making and may increase our ability to generate evidence from real-world data.
Published: 2024
Full Text: View/download PDF

2. Establishing a framework for privacy-preserving record linkage among electronic health record and administrative claims databases within PCORnet®, the National Patient-Centered Clinical Research Network

Author: Daniel Kiernan, Thomas Carton, Sengwee Toh, Jasmin Phua, Maryan Zirkle, Darcy Louzao, Kevin Haynes, Mark Weiner, Francisco Angulo, Charles Bailey, Jiang Bian, Daniel Fort, Shaun Grannis, Ashok Kumar Krishnamurthy, Vinit Nair, Pedro Rivera, Jonathan Silverstein, and Keith Marsolo
Subjects: Medical record linkage, Multicenter studies, Patient data privacy, Medicine, Biology (General), QH301-705.5, Science (General), Q1-390
Abstract: Abstract Objective The aim of this study was to determine whether a secure, privacy-preserving record linkage (PPRL) methodology can be implemented in a scalable manner for use in a large national clinical research network. Results We established the governance and technical capacity to support the use of PPRL across the National Patient-Centered Clinical Research Network (PCORnet®). As a pilot, four sites used the Datavant software to transform patient personally identifiable information (PII) into de-identified tokens. We queried the sites for patients with a clinical encounter in 2018 or 2019 and matched their tokens to determine whether overlap existed. We described patient overlap among the sites and generated a “deduplicated” table of patient demographic characteristics. Overlapping patients were found in 3 of the 6 site-pairs. Following deduplication, the total patient count was 3,108,515 (0.11% reduction), with the largest reduction in count for patients with an “Other/Missing” value for Sex; from 198 to 163 (17.6% reduction). The PPRL solution successfully links patients across data sources using distributed queries without directly accessing patient PII. The overlap queries and analysis performed in this pilot is being replicated across the full network to provide additional insight into patient linkages among a distributed research network.
Published: 2022
Full Text: View/download PDF

3. Broadening the reach of the FDA Sentinel system: A roadmap for integrating electronic health record data in a causal analysis framework

Author: Rishi J. Desai, Michael E. Matheny, Kevin Johnson, Keith Marsolo, Lesley H. Curtis, Jennifer C. Nelson, Patrick J. Heagerty, Judith Maro, Jeffery Brown, Sengwee Toh, Michael Nguyen, Robert Ball, Gerald Dal Pan, Shirley V. Wang, Joshua J. Gagne, and Sebastian Schneeweiss
Subjects: Computer applications to medicine. Medical informatics, R858-859.7
Abstract: Abstract The Sentinel System is a major component of the United States Food and Drug Administration’s (FDA) approach to active medical product safety surveillance. While Sentinel has historically relied on large quantities of health insurance claims data, leveraging longitudinal electronic health records (EHRs) that contain more detailed clinical information, as structured and unstructured features, may address some of the current gaps in capabilities. We identify key challenges when using EHR data to investigate medical product safety in a scalable and accelerated way, outline potential solutions, and describe the Sentinel Innovation Center’s initiatives to put solutions into practice by expanding and strengthening the existing system with a query-ready, large-scale data infrastructure of linked EHR and claims data. We describe our initiatives in four strategic priority areas: (1) data infrastructure, (2) feature engineering, (3) causal inference, and (4) detection analytics, with the goal of incorporating emerging data science innovations to maximize the utility of EHR data for medical product safety surveillance.
Published: 2021
Full Text: View/download PDF

4. Using the IMEDS distributed database for epidemiological studies in type 2 diabetes mellitus

Author: Young Hee Nam, Tongtong Wang, Robert T Greenlee, James Marshall, Aaron B Mendelsohn, Aziza Jamal-Allial, Sengwee Toh, Ting-Ying Huang, Carla Rodriguez-Watson, Shawna R Calhoun, Jillian Burk, Mano Selvan, Pamala A Pawloski, Cheryl N McMahill Walraven, Ashish Rai, and Jeffery S Brown
Subjects: Diseases of the endocrine glands. Clinical endocrinology, RC648-665
Abstract: Introduction This study aimed to assess data relevancy and data quality of the Innovation in Medical Evidence Development and Surveillance System Distributed Database (IMEDS-DD) for diabetes research and to evaluate comparability of its type 2 diabetes cohort to the general type 2 diabetes population.Research design and methods A retrospective study was conducted using the IMEDS-DD. Eligible members were adults with a medical encounter between April 1, 2018 and March 31, 2019 (index period). Type 2 diabetes and co-existing conditions were determined using all data available from April 1, 2016 to the most recent encounter within the index period. Type 2 diabetes patient characteristics, comorbidities and hemoglobin A1c (HbA1c) values were summarized and compared with those reported in national benchmarks and literature.Results Type 2 diabetes prevalence was 12.6% in the IMEDS-DD. Of 4 14 672 patients with type 2 diabetes, 52.8% were male, and the mean age was 65.0 (SD 13.3) years. Common comorbidities included hypertension (84.5%), hyperlipidemia (82.8%), obesity (45.3%), and cardiovascular disease (44.7%). Moderate-to-severe chronic kidney disease was observed in 20.2% patients. The most commonly used antihyperglycemic agents included metformin (35.7%), sulfonylureas (14.8%), and insulin (9.9%). Less than one-half (48.9%) had an HbA1c value recorded. These findings demonstrated the notable similarity in patient characteristics between type 2 diabetes populations identified within the IMEDS-DD and other large databases.Conclusions Despite the limitations related to HbA1c data, our findings indicate that the IMEDS-DD contains robust information on key data elements to conduct pharmacoepidemiological studies in diabetes, including member demographic and clinical characteristics and health services utilization.
Published: 2022
Full Text: View/download PDF

5. Continuity and Completeness of Electronic Health Record Data for Patients Treated With Oral Hypoglycemic Agents: Findings From Healthcare Delivery Systems in Taiwan

Author: Chien-Ning Hsu, Kelly Huang, Fang-Ju Lin, Huang-Tz Ou, Ling-Ya Huang, Hsiao-Ching Kuo, Chi-Chuan Wang, and Sengwee Toh
Subjects: electronic health records, healthcare system, data continuity, data completeness, data quality, Taiwan, Therapeutics. Pharmacology, RM1-950
Abstract: Objective: To evaluate the continuity and completeness of electronic health record (EHR) data, and the concordance of select clinical outcomes and baseline comorbidities between EHR and linked claims data, from three healthcare delivery systems in Taiwan.Methods: We identified oral hypoglycemic agent (OHA) users from the Integrated Medical Database of National Taiwan University Hospital (NTUH-iMD), which was linked to the National Health Insurance Research Database (NHIRD), from June 2011 to December 2016. A secondary evaluation involved two additional EHR databases. We created consecutive 90-day periods before and after the first recorded OHA prescription and defined patients as having continuous EHR data if there was at least one encounter or prescription in a 90-day interval. EHR data completeness was measured by dividing the number of encounters in the NTUH-iMD by the number of encounters in the NHIRD. We assessed the concordance between EHR and claims data on three clinical outcomes (cardiovascular events, nephropathy-related events, and heart failure admission). We used individual comorbidities that comprised the Charlson comorbidity index to examine the concordance of select baseline comorbidities between EHRs and claims.Results: We identified 39,268 OHA users in the NTUH-iMD. Thirty-one percent (n = 12,296) of these users contributed to the analysis that examined data continuity during the 6-month baseline and 24-month follow-up period; 31% (n = 3,845) of the 12,296 users had continuous data during this 30-month period and EHR data completeness was 52%. The concordance of major cardiovascular events, nephropathy-related events, and heart failure admission was moderate, with the NTU-iMD capturing 49–55% of the outcome events recorded in the NHIRD. The concordance of comorbidities was considerably different between the NTUH-iMD and NHIRD, with an absolute standardized difference >0.1 for most comorbidities examined. Across the three EHR databases studied, 29–55% of the OHA users had continuous records during the 6-month baseline and 24-month follow-up period.Conclusion: EHR data continuity and data completeness may be suboptimal. A thorough evaluation of data continuity and completeness is recommended before conducting clinical and translational research using EHR data in Taiwan.
Published: 2022
Full Text: View/download PDF

6. Suitability of administrative claims databases for bariatric surgery research – is the glass half-full or half-empty?

Author: Xiaojuan Li, Kristina H. Lewis, Katherine Callaway, J. Frank Wharam, and Sengwee Toh
Subjects: Bariatric surgery, Body mass index, Healthcare administrative claims, Predictive value of tests, Sensitivity and specificity, Validation study, Medicine (General), R5-920
Abstract: Abstract Background Claims databases are generally considered inadequate for obesity research due to suboptimal capture of body mass index (BMI) measurements. This might not be true for bariatric surgery because of reimbursement requirements and changes in coding systems. We assessed the availability and validity of claims-based weight-related diagnosis codes among bariatric surgery patients. Methods We identified three nested retrospective cohorts of adult bariatric surgery patients who underwent adjusted gastric banding, Roux-en-Y gastric bypass, or sleeve gastrectomy between January 1, 2011 and June 30, 2018 using different components of OptumLabs® Data Warehouse, which contains linked de-identified claims and electronic health records (EHRs). We measured the availability of claims-based weight-related diagnosis codes in the 6-month preoperative and 1-year postoperative periods in the main cohort identified in the claims data. We created two claims-based algorithms to classify the presence of severe obesity (a commonly used cohort selection criterion) and categorize BMI (a commonly used baseline confounder or postoperative outcome). We evaluated their performance by estimating sensitivity, specificity, positive predictive value, negative predictive value, and weighted kappa in two sub-cohorts using EHR-based BMI measurements as the reference. Results Among the 29,357 eligible patients identified using claims only, 28,828 (98.2%) had preoperative weight-related diagnosis codes, either granular indicating BMI ranges or nonspecific denoting obesity status. Among the 27,407 patients with granular preoperative codes, 12,346 (45.0%) had granular codes and 9355 (34.1%) had nonspecific codes in the 1-year postoperative period. Among the 3045 patients with both preoperative claims-based diagnosis codes and EHR-based BMI measurements, the severe obesity classification algorithm had a sensitivity 100%, specificity 71%, positive predictive value 100%, and negative predictive value 78%. The BMI categorization algorithm had good validity categorizing the last available preoperative or postoperative BMI measurements (weighted kappa [95% confidence interval]: preoperative 0.78, [0.76, 0.79]; postoperative 0.84, [0.80, 0.87]). Conclusions Claims-based weight-related diagnosis codes had excellent validity before and after bariatric surgical operation but suboptimal availability after operation. Claims databases can be used for bariatric surgery studies of non-weight-related effectiveness and safety outcomes that are well-captured.
Published: 2020
Full Text: View/download PDF

7. Privacy-protecting estimation of adjusted risk ratios using modified Poisson regression in multi-center studies

Author: Di Shu, Jessica G. Young, and Sengwee Toh
Subjects: Distributed analysis, Modified Poisson regression, Multi-center studies, Odds ratio, Privacy protection, Risk ratio, Medicine (General), R5-920
Abstract: Abstract Background Multi-center studies can generate robust and generalizable evidence, but privacy considerations and legal restrictions often make it challenging or impossible to pool individual-level data across data-contributing sites. With binary outcomes, privacy-protecting distributed algorithms to conduct logistic regression analyses have been developed. However, the risk ratio often provides a more transparent interpretation of the exposure-outcome association than the odds ratio. Modified Poisson regression has been proposed to directly estimate adjusted risk ratios and produce confidence intervals with the correct nominal coverage when individual-level data are available. There are currently no distributed regression algorithms to estimate adjusted risk ratios while avoiding pooling of individual-level data in multi-center studies. Methods By leveraging the Newton-Raphson procedure, we adapted the modified Poisson regression method to estimate multivariable-adjusted risk ratios using only summary-level information in multi-center studies. We developed and tested the proposed method using both simulated and real-world data examples. We compared its results with the results from the corresponding pooled individual-level data analysis. Results Our proposed method produced the same adjusted risk ratio estimates and standard errors as the corresponding pooled individual-level data analysis without pooling individual-level data across data-contributing sites. Conclusions We developed and validated a distributed modified Poisson regression algorithm for valid and privacy-protecting estimation of adjusted risk ratios and confidence intervals in multi-center studies. This method allows computation of a more interpretable measure of association for binary outcomes, along with valid construction of confidence intervals, without sharing of individual-level data.
Published: 2019
Full Text: View/download PDF

8. Use of Antidiabetic drugs during pregnancy among U.S. women with Livebirth deliveries in the Mini-Sentinel system

Author: Katrina Mott, Marsha E. Reichman, Sengwee Toh, Caren Kieswetter, Katherine Haffenreffer, and Susan E. Andrade
Subjects: Pregnancy, Glyburide, Insulin, Gestational diabetes, Gynecology and obstetrics, RG1-991
Abstract: Abstract Background As the prevalence of diabetes mellitus increases in the population, the exposure to antidiabetic drugs (ADDs) during pregnancies is expected to grow, as has been seen over the last decade. The objective of this study was to estimate the prevalence of ADD use during pregnancy among women in the Mini-Sentinel Distributed Database (MSDD) who delivered a liveborn infant. Methods We identified qualifying livebirth pregnancies among women aged 10 to 54 years in the MSDD from 2001 to 2013. ADD use was estimated using outpatient pharmacy dispensing claims and days-supplied among three cohorts: all livebirth pregnancies, pregnancies among women with pre-existing diabetes, and pregnancies among women without prior ADD use. Results Among the 1.9 million pregnancies in the MSDD that resulted in a livebirth from 2001 to 2013, 4.4% were exposed to an ADD. Of the 15,606 pregnancies (0.8%) with pre-existing diabetes, 92.8% were also exposed during the pregnancy period. The most commonly used product in these pregnancies was insulin (75.6% of pregnancies). In contrast, in pregnancies of women without prior ADD use, the most commonly used products were glyburide and insulin, and most of these users were diagnosed with gestational diabetes. Conclusions Patterns of ADD use during pregnancy described here, along with changes in disease incidence and management, highlight the importance of continuing surveillance of ADD utilization patterns and examining the safety and effectiveness of these products in pregnancy.
Published: 2019
Full Text: View/download PDF

9. Association Between Regular Laxative Use and Incident Dementia in UK Biobank Participants

Author: Zhirong Yang, Chang Wei, Xiaojuan Li, Jinqiu Yuan, Xuefeng Gao, Bingyu Li, Ziyi Zhao, Sengwee Toh, Xin Yu, Carol Brayne, Zuyao Yang, Feng Sha, and Jinling Tang
Subjects: Neurology (clinical)
Abstract: Background and ObjectivesThe use of over-the-counter laxatives is common in the general population. The microbiome-gut-brain axis hypothesis suggests that the use of laxatives could be associated with dementia. We aimed to examine the association between the regular use of laxatives and the incidence of dementia in UK Biobank participants.MethodsThis prospective cohort study was based on UK Biobank participants aged 40–69 years without a history of dementia. Regular use of laxatives was defined as self-reported use in most days of the week for the last 4 weeks at baseline (2006–2010). The outcomes were all-cause dementia, Alzheimer disease (AD), and vascular dementia (VD), identified from linked hospital admissions or death registers (up to 2019). Sociodemographic characteristics, lifestyle factors, medical conditions, family history, and regular medication use were adjusted for in the multivariable Cox regression analyses.ResultsAmong the 502,229 participants with a mean age of 56.5 (SD 8.1) years at baseline, 273,251 (54.4%) were female, and 18,235 (3.6%) reported regular use of laxatives. Over a mean follow-up of 9.8 years, 218 (1.3%) participants with regular use of laxatives and 1,969 (0.4%) with no regular use developed all-cause dementia. Multivariable analyses showed that regular use of laxatives was associated with increased risk of all-cause dementia (hazard ratio [HR] 1.51; 95% CI 1.30–1.75) and VD (HR 1.65; 95% CI 1.21–2.27), with no significant association observed for AD (HR 1.05; 95% CI 0.79–1.40). The risk of both all-cause dementia and VD increased with the number of regularly used laxative types (ptrend 0.001 and 0.04, respectively). Among the participants who clearly reported that they were using just 1 type of laxative (n = 5,800), only those using osmotic laxatives showed a statistically significantly higher risk of all-cause dementia (HR 1.64; 95% CI 1.20–2.24) and VD (HR 1.97; 95% CI 1.04–3.75). These results remained robust in various subgroup and sensitivity analyses.DiscussionRegular use of laxatives was associated with a higher risk of all-cause dementia, particularly in those who used multiple laxative types or osmotic laxative.
Published: 2023

10. Assessing medical product safety during pregnancy using parameterizable tools in the sentinel distributed database

Author: Jennifer G. Lyons, Elizabeth A. Suarez, Elnara Fazio‐Eynullayeva, Judith C. Maro, Catherine Corey, Jie Li, Sengwee Toh, and Mayura U. Shinde
Subjects: Epidemiology, Pharmacology (medical)
Abstract: The US Food and Drug Administration established the Sentinel System to monitor the safety of medical products. A component of this system includes parameterizable analytic tools to identify mother-infant pairs and evaluate infant outcomes to enable the routine monitoring of the utilization and safety of drugs used in pregnancy. We assessed the feasibility of using the data and tools in the Sentinel System by assessing a known association between topiramate use during pregnancy and oral clefts in the infant.We identified mother-infant pairs using the mother-infant linkage table from six data partners contributing to the Sentinel Distributed Database from January 1, 2000, to September 30, 2015. We compared mother-infant pairs with first-trimester exposure to topiramate to mother-infant pairs that were topiramate-unexposed or lamotrigine-exposed and used a validated algorithm to identify oral clefts in the infant. We estimated adjusted risk ratios through propensity score stratification.There were 2,007 topiramate-exposed and 1,066,086 unexposed mother-infant pairs in the main comparison. In the active-comparator analysis, there were 1,996 topiramate-exposed and 2,859 lamotrigine-exposed mother-infant pairs. After propensity score stratification, the odds ratio for oral clefts was 2.92 (95% CI: 1.43, 5.93) comparing the topiramate-exposed to unexposed groups and 2.72 (95% CI: 0.75, 9.93) comparing the topiramate-exposed to lamotrigine-exposed groups.We found an increased risk of oral clefts after topiramate exposure in the first trimester in the Sentinel database. These results are similar to prior published observational study results and demonstrate the ability of Sentinel's data and analytic tools to assess medical product safety in cohorts of mother-infant pairs in a timely manner. This article is protected by copyright. All rights reserved.
Published: 2022

11. Assessing the impact of privacy-preserving record linkage on record overlap and patient demographic and clinical characteristics in PCORnet®, the National Patient-Centered Clinical Research Network

Author: Keith Marsolo, Daniel Kiernan, Sengwee Toh, Jasmin Phua, Darcy Louzao, Kevin Haynes, Mark Weiner, Francisco Angulo, Charles Bailey, Jiang Bian, Daniel Fort, Shaun Grannis, Ashok Kumar Krishnamurthy, Vinit Nair, Pedro Rivera, Jonathan Silverstein, Maryan Zirkle, and Thomas Carton
Subjects: Health Informatics
Abstract: ObjectiveThis article describes the implementation of a privacy-preserving record linkage (PPRL) solution across PCORnet®, the National Patient-Centered Clinical Research Network.Material and MethodsUsing a PPRL solution from Datavant, we quantified the degree of patient overlap across the network and report a de-duplicated analysis of the demographic and clinical characteristics of the PCORnet population.ResultsThere were ∼170M patient records across the responding Network Partners, with ∼138M (81%) of those corresponding to a unique patient. 82.1% of patients were found in a single partner and 14.7% were in 2. The percentage overlap between Partners ranged between 0% and 80% with a median of 0%. Linking patients’ electronic health records with claims increased disease prevalence in every clinical characteristic, ranging between 63% and 173%.DiscussionThe overlap between Partners was variable and depended on timeframe. However, patient data linkage changed the prevalence profile of the PCORnet patient population.ConclusionsThis project was one of the largest linkage efforts of its kind and demonstrates the potential value of record linkage. Linkage between Partners may be most useful in cases where there is geographic proximity between Partners, an expectation that potential linkage Partners will be able to fill gaps in data, or a longer study timeframe.
Published: 2022

12. Real-World Evidence Comparing Vedolizumab and Ustekinumab in Antitumor Necrosis Factor-Experienced Patients With Crohn's Disease

Author: Michael D, Kappelman, Sruthi, Adimadhyam, Laura, Hou, Audrey E, Wolfe, Samantha, Smith, Andrew L, Simon, Érick, Moyneur, Juliane S, Reynolds, Sengwee, Toh, Angela, Dobes, Lauren E, Parlett, Kevin, Haynes, Mano, Selvan, Qianli, Ma, Vinit, Nair, Jessica, Burris, Jennifer E, Dorand, Ghadeer K, Dawwas, James D, Lewis, and Millie D, Long
Subjects: Hepatology, Gastroenterology
Abstract: Many patients with Crohn's disease (CD) lose response or become intolerant to antitumor necrosis factor (TNF) therapy and subsequently switch out of class. We compared the effectiveness and safety of ustekinumab to vedolizumab in a large, geographically diverse US population of TNF-experienced patients with CD.We conducted a retrospective cohort study using longitudinal claims data from a large US insurer (Anthem, Inc.). We identified patients with CD initiating vedolizumab or ustekinumab with anti-TNF treatment in the prior 6 months. Our primary outcome was treatment persistence for52 weeks. Secondary outcomes included (i) all-cause hospitalization, (ii) hospitalization for CD with surgery, (iii) hospitalization for CD without surgery, and (iv) hospitalization for infection. Propensity score fine stratification was used to control for demographic and baseline clinical characteristics and prior treatments.Among 885 new users of ustekinumab and 490 new users of vedolizumab, we observed no difference in treatment persistence (adjusted risk ratio 1.09 [95% confidence interval 0.95-1.25]). Ustekinumab was associated with a lower rate of all-cause hospitalization (adjusted hazard ratio 0.73 [0.59-0.91]), nonsurgical CD hospitalization (adjusted hazard ratio 0.58 [0.40-0.83]), and hospitalization for infection (adjusted hazard ratio 0.56 [0.34-0.92]).This real-world comparative effectiveness study of anti-TNF-experienced patients with CD initiating vedolizumab or ustekinumab showed similar treatment persistence rates beyond 52 weeks, although secondary outcomes such as all-cause hospitalizations, nonsurgical CD hospitalizations, and hospitalizations for infection favored ustekinumab initiation. We, therefore, advocate for individualized decision making in this medically refractory population, considering patient preference and other factors such as cost and route of administration.
Published: 2022

13. High‐dimensional propensity scores for empirical covariate selection in secondary database studies: Planning, implementation, and reporting

Author: Jeremy A. Rassen, Patrick Blin, Sebastian Kloss, Romain S. Neugebauer, Robert W. Platt, Anton Pottegård, Sebastian Schneeweiss, and Sengwee Toh
Subjects: Epidemiology, Pharmacology (medical)
Abstract: Real-world evidence used for regulatory, payer, and clinical decision-making requires principled epidemiology in design and analysis, applying methods to minimize confounding given the lack of randomization. One technique to deal with potential confounding is propensity score (PS) analysis, which allows for the adjustment for measured preexposure covariates. Since its first publication in 2009, the high-dimensional propensity score (hdPS) method has emerged as an approach that extends traditional PS covariate selection to include large numbers of covariates that may reduce confounding bias in the analysis of healthcare databases. hdPS is an automated, data-driven analytic approach for covariate selection that empirically identifies preexposure variables and proxies to include in the PS model. This article provides an overview of the hdPS approach and recommendations on the planning, implementation, and reporting of hdPS used for causal treatment-effect estimations in longitudinal healthcare databases. We supply a checklist with key considerations as a supportive decision tool to aid investigators in the implementation and transparent reporting of hdPS techniques, and to aid decision-makers unfamiliar with hdPS in the understanding and interpretation of studies employing this approach. This article is endorsed by the International Society for Pharmacoepidemiology.
Published: 2022

14. Association between glucagon‐like peptide‐1 receptor agonists and biliary‐related diseases in patients with type 2 diabetes: A nationwide cohort study

Author: Yaa‐Hui Dong, Jo‐Hsuan Wu, Chia‐Hsuin Chang, Jou‐Wei Lin, Li‐Chiu Wu, and Sengwee Toh
Subjects: Cohort Studies, Choledocholithiasis, Diabetes Mellitus, Type 2, Cholangitis, Cholecystitis, Acute, Humans, Hypoglycemic Agents, Female, Pharmacology (medical), Sodium-Glucose Transporter 2 Inhibitors, Glucagon-Like Peptide-1 Receptor, Retrospective Studies
Abstract: Clinical trials have suggested that glucagon-like peptide-1 receptor agonists (GLP-1RAs) may be associated with a higher risk of biliary-related diseases in patients with type 2 diabetes. Limited real-world studies have examined the comparative biliary safety of GLP-1RAs versus other antihyperglycemic drugs. We aimed to estimate the comparative risk of biliary-related diseases between GLP-1RAs and sodium glucose cotransporter 2 inhibitors (SGLT2is), which are indicated for patients with similar diabetes severity in Taiwan.Retrospective cohort study.Taiwan National Health Insurance Database during 2011 to 2018.Patients with type 2 diabetes who initiated GLP-1RAs or SGLT2is.GLP-1RAs versus SGLT2is.We used an on-treatment approach to examine the effect of continuous use and an intention-to-treat approach to assess the effect of initiation of GLP-1RAs versus SGLT2is. We used Coxregression models to estimate the hazard ratios (HRs) and 95% confidenceintervals (CIs) for the composite hospitalized biliary-related diseases, including acute cholecystitis or cholecystectomy, choledocholithiasis, and acute cholangitis, after matching each GLP-1RA initiator to up to 10 SGLT2iinitiators using propensity scores (PSs). Among 78,253 PS-matched patients, GLP-1RA use was associated with a numerically higher risk of biliary-related diseases versus SGLT2i use in the on-treatment analysis, with an HR of 1.20 (95% CI, 0.93-1.56) for the composite outcome, an HR of 1.22 (95% CI, 0.92-1.62) for acute cholecystitis or cholecystectomy, an HR of 1.20 (95% CI, 0.69-2.07) for choledocholithiasis, and an HR of 1.14 (95% CI,0.82-2.42) for acute cholangitis. The HRs were more pronounced in theintention-to-treat analysis (1.27 [95% CI, 1.05-1.53] for the composite outcome, 1.29 [95% CI, 1.04-1.58] foracute cholecystitis or cholecystectomy, 1.74 [95% CI, 1.23-2.46] for choledocholithiasis, and 1.31 [95% CI, 0.89-1.94] for acute cholangitis). The increased risk of the composite outcome associated with GLP-1RAs was more evident in patients aged 〉60 years, women, and 120 days after treatment initiation. Liraglutide, but not dulaglutide, was associated with an elevated risk.GLP-1RAs might be associated with an elevated risk of biliary-related diseases compared to SGLT2is in Asian patients with type 2 diabetes.
Published: 2022

15. Use of Linked Databases for Improved Confounding Control: Considerations for Potential Selection Bias

Author: Jenny W Sun, Rui Wang, Dongdong Li, and Sengwee Toh
Subjects: Cohort Studies, Adolescent, Bias, Diabetes Mellitus, Type 2, Practice of Epidemiology, Epidemiology, Pharmacoepidemiology, Humans, Selection Bias
Abstract: Pharmacoepidemiologic studies are increasingly conducted within linked databases, often to obtain richer confounder data. However, the potential for selection bias is frequently overlooked when linked data is available only for a subset of patients. We highlight the importance of accounting for potential selection bias by evaluating the association between antipsychotics and type 2 diabetes in youths within a claims database linked to a smaller laboratory database. We used inverse probability of treatment weights (IPTW) to control for confounding. In analyses restricted to the linked cohorts, we applied inverse probability of selection weights (IPSW) to create a population representative of the full cohort. We used pooled logistic regression weighted by IPTW only or IPTW and IPSW to estimate treatment effects. Metabolic conditions were more prevalent in linked cohorts compared with the full cohort. Within the full cohort, the confounding-adjusted hazard ratio was 2.26 (95% CI: 2.07, 2.49) comparing initiation of antipsychotics with initiation of control medications. Within the linked cohorts, a different magnitude of association was obtained without adjustment for selection, whereas applying IPSW resulted in point estimates similar to the full cohort’s (e.g., an adjusted hazard ratio of 1.63 became 2.12). Linked database studies may generate biased estimates without proper adjustment for potential selection bias.
Published: 2022

16. Validation of diagnosis codes to identify hospitalized <scp>COVID</scp> ‐19 patients in health care claims data

Author: Sheryl A. Kluberg, Laura Hou, Sarah K. Dutcher, Monisha Billings, Brian Kit, Sengwee Toh, Sascha Dublin, Kevin Haynes, Annemarie Kline, Mahesh Maiyani, Pamala A. Pawloski, Eric S. Watson, and Noelle M. Cocoros
Subjects: COVID-19 Testing, Databases, Factual, International Classification of Diseases, SARS-CoV-2, Epidemiology, COVID-19, Humans, Pharmacology (medical), Delivery of Health Care, Algorithms
Abstract: Health plan claims may provide complete longitudinal data for timely, real-world population-level COVID-19 assessment. However, these data often lack laboratory results, the standard for COVID-19 diagnosis.We assessed the validity of ICD-10-CM diagnosis codes for identifying patients hospitalized with COVID-19 in U.S. claims databases, compared to linked laboratory results, among six Food and Drug Administration Sentinel System data partners (two large national insurers, four integrated delivery systems) from February 20-October 17, 2020. We identified patients hospitalized with COVID-19 according to five ICD-10-CM diagnosis code-based algorithms, which included combinations of codes U07.1, B97.29, general coronavirus codes, and diagnosis codes for severe symptoms. We calculated the positive predictive value (PPV) and sensitivity of each algorithm relative to laboratory test results. We stratified results by data source type and across three time periods: February 20-March 31 (Time A), April 1-30 (Time B), May 1-October 17 (Time C).The five algorithms identified between 34 806 and 47 293 patients across the study periods; 23% with known laboratory results contributed to PPV calculations. PPVs were high and similar across algorithms. PPV of U07.1 alone was stable around 93% for integrated delivery systems, but declined over time from 93% to 70% among national insurers. Overall PPV of U07.1 across all data partners was 94.1% (95% CI, 92.3%-95.5%) in Time A and 81.2% (95% CI, 80.1%-82.2%) in Time C. Sensitivity was consistent across algorithms and over time, at 94.9% (95% CI, 94.2%-95.5%).Our results support the use of code U07.1 to identify hospitalized COVID-19 patients in U.S. claims data.
Published: 2022

17. The US Food and Drug Administration Sentinel System: a national resource for a learning health system

Author: Jeffrey S Brown, Aaron B Mendelsohn, Young Hee Nam, Judith C Maro, Noelle M Cocoros, Carla Rodriguez-Watson, Catherine M Lockhart, Richard Platt, Robert Ball, Gerald J Dal Pan, and Sengwee Toh
Subjects: Pharmaceutical Preparations, United States Food and Drug Administration, Health Informatics, Learning Health System, United States, Perspectives
Abstract: The US Food and Drug Administration (FDA) created the Sentinel System in response to a requirement in the FDA Amendments Act of 2007 that the agency establish a system for monitoring risks associated with drug and biologic products using data from disparate sources. The Sentinel System has completed hundreds of analyses, including many that have directly informed regulatory decisions. The Sentinel System also was designed to support a national infrastructure for a learning health system. Sentinel governance and guiding principles were designed to facilitate Sentinel’s role as a national resource. The Sentinel System infrastructure now supports multiple non-FDA projects for stakeholders ranging from regulated industry to other federal agencies, international regulators, and academics. The Sentinel System is a working example of a learning health system that is expanding with the potential to create a global learning health system that can support medical product safety assessments and other research.
Published: 2022

18. The Authors’ Reply: Statins and post-stroke dementia

Author: Zhirong Yang, Sengwee Toh, and Jonathan Mant
Subjects: Epidemiology
Published: 2023

19. Meta-analysis with sample-standardization in multi-site studies

Author: Di Shu, Michael Webster‐Clark, Robert W. Platt, and Sengwee Toh
Subjects: Causality, Models, Statistical, Epidemiology, Humans, Pharmacology (medical), Computer Simulation, Reference Standards
Abstract: To conceptualize a particular target population and estimand for multi-site pharmacoepidemiologic studies within data networks and to analytically examine sample-standardization as a meta-analytic method compared with inverse-variance weighted meta-analyses.The target population of interest is all and only all individuals from the data-contributing sites. Standardization, a general conditioning technique frequently employed for confounding control, was adopted to estimate the network-wide causal treatment effect. Specifically, the proposed sample-standardization yields a meta-analysis estimator, that is, a weighted summation of site-specific results, where the weight for a site is the proportion of its size in the entire network. This sample-standardization estimator was evaluated analytically in comparison to estimators from inverse-variance weighted fixed-effect and random-effects meta-analyses in terms of statistical consistency.A proof is reported to justify the consistency of the sample-standardization estimator with and without treatment effect heterogeneity by site. Both inverse-variance weighted fixed-effect and random-effects meta-analyses were found to generally result in inconsistent estimators in the presence of treatment effect heterogeneity by site for this particular target population and estimand.Sample-standardization is a valid approach to generate causal inference in multi-site studies when the target population comprises all and only all individuals within the network, even in the presence of heterogeneity of treatment effect by site. Multi-site studies should clearly specify the target population and estimand to help select the most appropriate meta-analytic methods.
Published: 2022

20. Early Life Antibiotic Prescriptions and Weight Outcomes in Children 10 Years of Age

Author: Christopher B. Forrest, William J. Heerman, L. Charles Bailey, Melanie Jay, Ihuoma Eneli, Jessica G. Young, Casie Horgan, Daniel S. Hsia, Juliane S. Reynolds, Goutham Rao, Leonardo Trasande, PCORnet Antibiotics, Sheryl L. Rifas-Shiman, Sengwee Toh, Pi-I D. Lin, Jessica L. Sturtevant, Jason P. Block, Doug Lunsford, Matthew F. Daley, and Jonathan A. Finkelstein
Subjects: Pediatrics, medicine.medical_specialty, medicine.drug_class, Antibiotics, Body Mass Index, 03 medical and health sciences, 0302 clinical medicine, 030225 pediatrics, medicine, Humans, Obesity, 030212 general & internal medicine, Antibiotic use, Medical prescription, Child, business.industry, Overweight, medicine.disease, Early life, Anti-Bacterial Agents, Prescriptions, Clinical research, Child, Preschool, Pediatrics, Perinatology and Child Health, business, Body mass index, Healthcare system
Abstract: We previously found that antibiotic use at24 months of age was associated with slightly higher body weight at 5 years of age. In this study, we examine associations of early life antibiotic prescriptions with weight outcomes at 108 to 132 months of age ("10 years").We used electronic health record data from 2009 through 2016 from 10 health systems in PCORnet, a national distributed clinical research network. We examined associations of any (vs no) antibiotics at24 months of age with body mass index z-score (BMI-z) at 10 years adjusted for confounders selected a priori. We further examined dose response (number of antibiotic episodes) and antibiotic spectrum (narrow and broad).Among 56,727 included children, 57% received any antibiotics at24 months; at 10 years, mean (standard deviation) BMI-z was 0.54 (1.14), and 36% had overweight or obesity. Any versus no antibiotic use at24 months was associated with a slightly higher BMI-z at 10 years among children without a complex chronic condition (β 0.03; 95% confidence interval [CI] 0.01, 0.05) or with a complex chronic condition (β 0.09; 95% CI 0.03, 0.15). Any versus no antibiotic use was not associated with odds of overweight or obesity at 10 years among children without (odds ratio 1.02; 95% CI 0.97, 1.07) or with a complex chronic condition (odds ratio 1.07; 95% CI 0.96, 1.19).The small and likely clinically insignificant associations in this study are consistent with our previous 5-year follow-up results, suggesting that, if this relationship is indeed causal, early increases in weight are small but maintained over time.
Published: 2021

21. Estimating the marginal hazard ratio by simultaneously using a set of propensity score models: A multiply robust approach

Author: Di Shu, Peisong Han, Rui Wang, and Sengwee Toh
Subjects: Statistics and Probability, Models, Statistical, Epidemiology, Proportional hazards model, Computer science, Inverse probability weighting, Hazard ratio, Estimator, Inverse probability, Empirical likelihood, Research Design, Consistency (statistics), Propensity score matching, Statistics, Humans, Computer Simulation, Propensity Score, Proportional Hazards Models
Abstract: The inverse probability weighted Cox model is frequently used to estimate the marginal hazard ratio. Its validity requires a crucial condition that the propensity score model be correctly specified. To provide protection against misspecification of the propensity score model, we propose a weighted estimation method rooted in the empirical likelihood theory. The proposed estimator is multiply robust in that it is guaranteed to be consistent when a set of postulated propensity score models contains a correctly specified model. Our simulation studies demonstrate satisfactory finite sample performance of the proposed method in terms of consistency and efficiency. We apply the proposed method to compare the risk of postoperative hospitalization between sleeve gastrectomy and Roux-en-Y gastric bypass using data from a large medical claims and billing database. We further extend the development to multisite studies to enable each site to postulate multiple site-specific propensity score models.
Published: 2021

22. Invited Commentary: Go BIG and Go Global—Executing Large-Scale, Multisite Pharmacoepidemiologic Studies Using Real-World Data

Author: Judith C Maro and Sengwee Toh
Subjects: Databases, Factual, Epidemiology, Invited Commentary, Pharmacoepidemiology, Humans
Abstract: At the time medical products are approved, we rarely know enough about their comparative safety and effectiveness vis-à-vis alternative therapies to advise patients and providers. Postmarket generation of evidence on rare adverse events following medical product exposure increasingly requires analysis of millions of longitudinal patient records that can provide complete capture of data on patient experiences. In the accompanying article by Pradhan et al. (Am J Epidemiology. 2022;191(8):1352–1367), the authors demonstrate how observational database studies are often the most practical approach, provided these databases are carefully chosen to be “fit for purpose.” Distributed data networks with common data models have proliferated in the last 2 decades in pharmacoepidemiology, allowing efficient capture of patient data in a standardized and structured format across disparate real-world data sources. Use of common data models facilitates transparency by allowing standardized programming approaches that can be easily reproduced. The distributed data network architecture, combined with a common data approach, supports not only multisite observational studies but also pragmatic clinical trials. It also helps bridge international boundaries and further increases the sample size and diversity of study populations.
Published: 2022

23. Profiling real-world data sources for pharmacoepidemiologic research: A call for papers

Author: Xiaojuan Li, Vincent Lo Re, and Sengwee Toh
Subjects: Epidemiology, Pharmacoepidemiology, Humans, Information Storage and Retrieval, Pharmacology (medical)
Published: 2022

24. Using the IMEDS distributed database for epidemiological studies in type 2 diabetes mellitus

Author: Ting-Ying Huang, Carla Rodriguez-Watson, Tongtong Wang, Shawna R Calhoun, James Marshall, Jillian Burk, Young Hee Nam, Aaron B Mendelsohn, Aziza Jamal-Allial, Robert T Greenlee, Mano Selvan, Pamala A Pawloski, Cheryl N McMahill Walraven, Ashish Rai, Sengwee Toh, and Jeffery S Brown
Subjects: Adult, Male, Diabetes Mellitus, Type 2, Endocrinology, Diabetes and Metabolism, Humans, Hypoglycemic Agents, Insulin, Female, Metformin, Aged, Retrospective Studies
Abstract: IntroductionThis study aimed to assess data relevancy and data quality of the Innovation in Medical Evidence Development and Surveillance System Distributed Database (IMEDS-DD) for diabetes research and to evaluate comparability of its type 2 diabetes cohort to the general type 2 diabetes population.Research design and methodsA retrospective study was conducted using the IMEDS-DD. Eligible members were adults with a medical encounter between April 1, 2018 and March 31, 2019 (index period). Type 2 diabetes and co-existing conditions were determined using all data available from April 1, 2016 to the most recent encounter within the index period. Type 2 diabetes patient characteristics, comorbidities and hemoglobin A1c(HbA1c) values were summarized and compared with those reported in national benchmarks and literature.ResultsType 2 diabetes prevalence was 12.6% in the IMEDS-DD. Of 4 14 672 patients with type 2 diabetes, 52.8% were male, and the mean age was 65.0 (SD 13.3) years. Common comorbidities included hypertension (84.5%), hyperlipidemia (82.8%), obesity (45.3%), and cardiovascular disease (44.7%). Moderate-to-severe chronic kidney disease was observed in 20.2% patients. The most commonly used antihyperglycemic agents included metformin (35.7%), sulfonylureas (14.8%), and insulin (9.9%). Less than one-half (48.9%) had an HbA1cvalue recorded. These findings demonstrated the notable similarity in patient characteristics between type 2 diabetes populations identified within the IMEDS-DD and other large databases.ConclusionsDespite the limitations related to HbA1cdata, our findings indicate that the IMEDS-DD contains robust information on key data elements to conduct pharmacoepidemiological studies in diabetes, including member demographic and clinical characteristics and health services utilization.
Published: 2022

25. Comparative cardiovascular effectiveness of glucagon-like peptide-1 receptor agonists versus sodium-glucose cotransporter-2 inhibitors in patients with type 2 diabetes: A population-based cohort study

Author: Yaa‐Hui Dong, Chia‐Hsuin Chang, Jou‐Wei Lin, Wei‐Shun Yang, Li‐Chiu Wu, and Sengwee Toh
Subjects: Endocrinology, Diabetes and Metabolism, Sodium, Myocardial Infarction, Glucagon-Like Peptide-1 Receptor, Brain Ischemia, Cohort Studies, Stroke, Hemorrhagic Stroke, Endocrinology, Glucose, Diabetes Mellitus, Type 2, Cardiovascular Diseases, Internal Medicine, Humans, Hypoglycemic Agents, Renal Insufficiency, Chronic, Sodium-Glucose Transporter 2 Inhibitors, Ischemic Stroke
Abstract: To examine the comparative effectiveness of glucagon-like peptide-1 receptor agonists (GLP-1RAs) and sodium-glucose cotransporter-2 (SGLT2) inhibitors for select cardiovascular outcomes and to examine whether the relative risks varied across different patient subgroups in patients with type 2 diabetes.We conducted a nationwide cohort study of patients with type 2 diabetes who initiated GLP-1RAs or SGLT2 inhibitors between 2012 and 2018 in Taiwan. The study outcomes included myocardial infarction and total stroke, further classified into ischaemic or haemorrhagic stroke. We estimated the hazard ratios (HRs) and 95% confidence intervals (CIs) for each outcome, comparing GLP-1RAs with SGLT2 inhibitors using Cox proportional hazards models after 1:1 propensity-score (PS) matching. We also examined if there was effect modification by age, underlying chronic kidney disease, or coexisting cardiovascular disease in prespecified subgroup analyses.Among 26 032 PS-matched patients, GLP-1RA initiators and SGLT2 inhibitor initiators showed similar risks of myocardial infarction (HR 0.99, 95% CI 0.65-1.52), total stroke (HR 0.90, 95% CI 0.69-1.17), ischaemic stroke (HR 0.86, 95% CI 0.65-1.14) and haemorrhagic stroke (HR 0.88, 95% CI 0.63-1.25). However, GLP-1RA treatment was associated with an increased risk of total stroke (HR 1.76, 95% CI 1.06-2.94) and ischaemic stroke (HR 1.88, 95% CI 1.09-3.23) among patients with chronic kidney disease, but not among patients without chronic kidney disease. GLP-1RA therapy seemed to have a lower risk of haemorrhagic stroke among patients with cardiovascular disease (HR 0.64, 95% CI 0.43-0.97), but not in patients without cardiovascular disease.Glucagon-like peptide-1 receptor agonists and SGLT2 inhibitors appeared to have comparable effectiveness with regard to several cardiovascular outcomes overall, but their comparative effectiveness may vary in certain patient subgroups.
Published: 2022

26. Real-World Evidence Comparing Vedolizumab and Ustekinumab in Antitumor Necrosis Factor-Experienced Patients With Crohn’s Disease.

Author: Kappelman, Michael D., Adimadhyam, Sruthi, Hou, Laura, Wolfe, Audrey E., Smith, Samantha, Simon, Andrew L., Moyneur, Erick, Reynolds, Juliane S., Sengwee Toh, Dobes, Angela, Parlett, Lauren E., Haynes, Kevin, Selvan, Mano, Qianli Ma, Nair, Vinit, Burris, Jessica, Dorand, Jennifer E., Dawwas, Ghadeer K., Lewis, James D., and Long, Millie D.
Published: 2023
Full Text: View/download PDF

27. Building an active medical product safety surveillance system in Taiwan: Adaptation of the U.S. Sentinel System common data model structure to the National Health Insurance Research Database in Taiwan

Author: Huang Tz Ou, Kelly Huang, Chi-Chuan Wang, Fang-Ju Lin, Chien Ning Hsu, Sengwee Toh, and Ling Ya Huang
Subjects: Resource (biology), Databases, Factual, National Health Programs, Epidemiology, business.industry, Taiwan, Sample (statistics), Pharmacoepidemiology, Pharmacovigilance, Data model, Environmental health, Data quality, Cohort, Electronic Health Records, Humans, Medicine, Pharmacology (medical), business, Cause of death
Abstract: PURPOSE Using real-world data to support regulatory decision has become a global movement. However, a robust platform for active surveillance of medical product safety has not been established in Taiwan. METHODS Following the common data model structure of the U.S. Food and Drug Administration's Sentinel System, we built the Taiwan Sentinel Data Model (TSDM) using the National Health Insurance Research Database with longitudinal claims data from 23 million individuals, linked death and cause of death data from a national registry, and linked electronic health record data from a delivery system. We examined the conversion of the TSDM using the Sentinel Data Quality Review and Characterization Programs in a sample of sex- and age-stratified cohort of 3 million individuals. RESULTS The TSDM fulfilled the requirements of data quality assurance. Only about 6% of sex and 0.0007% of birth year were missing, and
Published: 2020

28. The Certainty Framework for Assessing Real‐World Data in Studies of Medical Product Safety and Effectiveness

Author: Nancy A Dreyer, Miriam C. J. M. Sturkenboom, Sengwee Toh, Noelle M. Cocoros, Solomon Iyasu, Chieko Ishiguro, Wei Zhou, and Peter Arlett
Subjects: Data Analysis, Research design, Computer science, media_common.quotation_subject, Information Storage and Retrieval, Context (language use), Antiviral Agents, 030226 pharmacology & pharmacy, Outcome (game theory), 03 medical and health sciences, Oseltamivir, 0302 clinical medicine, Rivaroxaban, Influenza, Human, Humans, Pharmacology (medical), media_common, Pharmacology, Confounding, Certainty, Variable (computer science), Range (mathematics), Risk analysis (engineering), Research Design, Medical product, 030220 oncology & carcinogenesis, Algorithms
Abstract: A fundamental question in using real-world data for clinical and regulatory decision making is: How certain must we be that the algorithm used to capture an exposure, outcome, cohort-defining characteristic, or confounder is what we intend it to be? We provide a practical framework to help researchers and regulators assess and classify the fit-for-purposefulness of real-world data by study variable for a range of data sources. The three levels of certainty (optimal, sufficient, and probable) must be considered in the context of each study variable, the specific question being studied, the study design, and the decision at hand.
Published: 2020

29. Variance estimation in inverse probability weighted Cox models

Author: Di Shu, Sengwee Toh, Jessica G. Young, and Rui Wang
Subjects: Statistics and Probability, Independent and identically distributed random variables, Estimating equations, 01 natural sciences, General Biochemistry, Genetics and Molecular Biology, 010104 statistics & probability, 03 medical and health sciences, Linearization, Statistics, Computer Simulation, 0101 mathematics, Equivalence (measure theory), Probability, Proportional Hazards Models, 030304 developmental biology, Mathematics, 0303 health sciences, Models, Statistical, General Immunology and Microbiology, Applied Mathematics, Inverse probability weighting, Estimator, General Medicine, Variance (accounting), Inverse probability, Research Design, General Agricultural and Biological Sciences
Abstract: Inverse probability weighted Cox models can be used to estimate marginal hazard ratios under different point treatments in observational studies. To obtain variance estimates, the robust sandwich variance estimator is often recommended to account for the induced correlation among weighted observations. However, this estimator does not incorporate the uncertainty in estimating the weights and tends to overestimate the variance, leading to inefficient inference. Here we propose a new variance estimator that combines the estimation procedures for the hazard ratio and weights using stacked estimating equations, with additional adjustments for the sum of terms that are not independently and identically distributed in a Cox partial likelihood score equation. We prove analytically that the robust sandwich variance estimator is conservative and establish the asymptotic equivalence between the proposed variance estimator and one obtained through linearization by Hajage et al. in 2018. In addition, we extend our proposed variance estimator to accommodate clustered data. We compare the finite sample performance of the proposed method with alternative methods through simulation studies. We illustrate these different variance methods in both independent and clustered data settings, using a bariatric surgery dataset and a multiple readmission dataset, respectively. To facilitate implementation of the proposed method, we have developed an R package ipwCoxCSV.
Published: 2020

30. Consequences of Depletion of Susceptibles for Hazard Ratio Estimators Based on Propensity Scores

Author: Robert J. Wellman, Sebastian Schneeweiss, Laura B. Amsden, Richard Wyss, Judith C. Maro, Joshua J. Gagne, Zilu Zhang, Jessica M. Franklin, Bruce Fireman, Jennifer C. Nelson, Sengwee Toh, Catherine Rogers Murray, and Susan Gruber
Subjects: Propensity score, Epidemiology, Hazard ratio, 01 natural sciences, Cohort Studies, 010104 statistics & probability, 03 medical and health sciences, Treatment status, 0302 clinical medicine, Bias, Statistics, Methods, Depletion of susceptibles, Humans, 030212 general & internal medicine, 0101 mathematics, Mathematics, Survivor bias, Proportional Hazards Models, Proportional hazards model, Confounding, Null (mathematics), Estimator, Survival analysis, Relative risk, Propensity score matching, Noncollapsibility, ComputingMethodologies_DOCUMENTANDTEXTPROCESSING
Abstract: Supplemental Digital Content is available in the text., We use simulated data to examine the consequences of depletion of susceptibles for hazard ratio (HR) estimators based on a propensity score (PS). First, we show that the depletion of susceptibles attenuates marginal HRs toward the null by amounts that increase with the incidence of the outcome, the variance of susceptibility, and the impact of susceptibility on the outcome. If susceptibility is binary then the Bross bias multiplier, originally intended to quantify bias in a risk ratio from a binary confounder, also quantifies the ratio of the instantaneous marginal HR to the conditional HR as susceptibles are depleted differentially. Second, we show how HR estimates that are conditioned on a PS tend to be between the true conditional and marginal HRs, closer to the conditional HR if treatment status is strongly associated with susceptibility and closer to the marginal HR if treatment status is weakly associated with susceptibility. We show that associations of susceptibility with the PS matter to the marginal HR in the treated (ATT) though not to the marginal HR in the entire cohort (ATE). Third, we show how the PS can be updated periodically to reduce depletion-of-susceptibles bias in conditional estimators. Although marginal estimators can hit their ATE or ATT targets consistently without updating the PS, we show how their targets themselves can be misleading as they are attenuated toward the null. Finally, we discuss implications for the interpretation of HRs and their relevance to underlying scientific and clinical questions. See video Abstract: http://links.lww.com/EDE/B727.
Published: 2020

31. Conducting prospective sequential surveillance in real‐world dynamic distributed databases

Author: Judith C. Maro, Michael D. Nguyen, Laura Hou, Efe Eworuke, Sengwee Toh, Rima Izem, Bruce Fireman, Emily C. Welch, Joo‐Yeon Lee, and Margie R. Goulding
Subjects: Data collection, Information retrieval, Databases, Factual, Distributed database, United States Food and Drug Administration, Epidemiology, business.industry, Data Collection, MEDLINE, Datasets as Topic, Pilot Projects, United States, Computer Communication Networks, Propensity score matching, Product Surveillance, Postmarketing, Humans, Medicine, Pharmacology (medical), Prospective Studies, Propensity Score, business, Drug Approval, Decision Making, Organizational
Published: 2020

32. Translating claims‐based CHA2DS2‐VaSc and HAS‐BLED to ICD‐10‐CM: Impacts of mapping strategies

Author: Michael Webster-Clark, Ting-Ying Huang, Sengwee Toh, and Laura Hou
Subjects: HAS-BLED, medicine.medical_specialty, Epidemiology, business.industry, medicine.drug_class, Anticoagulant, Warfarin, ICD-10, Atrial fibrillation, medicine.disease, 030226 pharmacology & pharmacy, Confidence interval, 03 medical and health sciences, 0302 clinical medicine, Internal medicine, Cardiology, Medicine, Pharmacology (medical), 030212 general & internal medicine, Diagnosis code, business, Stroke, medicine.drug
Abstract: Purpose The CHA2 DS2 -VaSc and HAS-BLED risk scores are commonly used in the studies of oral anticoagulants (OACs). The best ways to map these scores to the International Classification of Diseases, 10th Revision, Clinical Modification (ICD-10-CM) codes is unclear, as is how they perform in various types of OAC users. We aimed to assess the distributions of CHA2 DS2 -VaSc and HAS-BLED scores and C-statistics for outcome prediction in the ICD-10-CM era using different mapping strategies. Methods We compared the distributions of CHA2 DS2 -VaSc and HAS-BLED scores from various mapping strategies in atrial fibrillation patients before, during, and after ICD-10-CM transition. We estimated the C-statistics predicting the 90-day risk of hospitalized stroke (for CHA2 DS2 -VaSc) or hospitalized bleeding (for HAS-BLED) in patients identified at least 6 months after the ICD-10-CM transition, overall and by anticoagulant type. Results Forward-backward mapping produced higher CHA2 DS2 -VaSc and HAS-BLED scores in the ICD-10-CM era compared to the ICD-9-CM era: the mean difference was 0.074 (95% confidence interval 0.064-0.085) for CHA2 DS2 -VaSc and 0.055 (0.048-0.062) for HAS-BLED. Both scores had higher C-statistics in patients taking no OACs (0.697 [0.677-0.717] for CHA2 DS2 -VaSc; 0.719 [0.702-0.737] for HAS-BLED) or direct OACs (0.695 [0.654-0.735] for CHA2 DS2 -VaSc; 0.700 [0.673-0.728] for HAS-BLED) than those taking warfarin (0.655 [0.613-0.697] for CHA2 DS2 -VaSc; 0.663 [0.6320.695] for HAS-BLED). Conclusions Existing mapping strategies generally preserved the distributions of CHA2 DS2 -VaSc and HAS-BLED scores after ICD-10-CM transition. Both scores performed better in patients on no OACs or direct OACs than patients on warfarin.
Published: 2020

33. Use of real‐world evidence in regulatory decisions for rare diseases in the United States—Current status and future directions

Author: Cunlin Wang, Sengwee Toh, Federica Edith Pisa, Larry Bauer, and Jasmanda Wu
Subjects: Orphan Drug Production, Epidemiology, media_common.quotation_subject, Decision Making, 030226 pharmacology & pharmacy, law.invention, Orphan drug, 03 medical and health sciences, Rare Diseases, 0302 clinical medicine, Drug Development, Randomized controlled trial, law, Patient experience, Humans, Medicine, Pharmacology (medical), 030212 general & internal medicine, Baseline (configuration management), Drug Approval, Selection Bias, Randomized Controlled Trials as Topic, media_common, Selection bias, Data collection, United States Food and Drug Administration, business.industry, Medical record, Confounding Factors, Epidemiologic, United States, Drug development, Risk analysis (engineering), Research Design, business
Abstract: Following the release of the framework for the Real-World Evidence (RWE) Program, the US Food and Drug Administration (FDA) is actively evaluating and exploring ways to optimize the utility of real-world data (RWD) and RWE to support regulatory decision making. For rare conditions, conducting traditional randomized clinical trials may not always be feasible, and RWD and RWE have played and will continue to play an important role. We use three case examples-cerliponase alfa, asfotase alfa, and uridine triacetate-to illustrate how RWD from disease registries, medical records with chart review, and literature, respectively, have been used to generate RWE to support regulatory decisions for selected rare diseases. These examples highlight the need for improving data reliability and quality in existing data to expand use of RWD and RWE beyond "hard endpoints" and standardizing data collection for outcome measures in patient registries to expand its utility. We also discuss a recent FDA guidance for using RWE in supporting rare disease drug development, including its recommendations about using natural history studies as external control groups for single-arm interventional trials. The external control group needs to be comparable with the treated group. Selection bias and confounding are major concerns because of lack of randomization and unrecognized baseline differences. Use of valid epidemiological approaches can reduce these biases. Lastly, we discuss future directions to expand the use of RWD and RWE to support orphan drug approvals, including the need for including patient experience data as an important source of RWD.
Published: 2020

34. Analytic and Data Sharing Options in Real‐World Multidatabase Studies of Comparative Effectiveness and Safety of Medical Products

Author: Sengwee Toh
Subjects: Pharmacology, Data collection, Databases, Factual, Information Dissemination, Computer science, Data Collection, Pooling, MEDLINE, 030226 pharmacology & pharmacy, Outcome (game theory), Automatic summarization, Article, Data sharing, 03 medical and health sciences, Treatment Outcome, 0302 clinical medicine, Equipment and Supplies, Risk analysis (engineering), Data Interpretation, Statistical, 030220 oncology & carcinogenesis, Covariate, Key (cryptography), Humans, Pharmacology (medical)
Abstract: A wide range of analytic and data sharing options are available in non-experimental multi-database studies designed to assess the real-world benefits and risks of medical products. Researchers often consider six scientific domains when choosing among these options – study design, exposure type, outcome type, covariate summarization technique, covariate adjustment method, and data sharing approach. This article reviews available analytic and data sharing options and discuss key scientific and practical considerations when choosing among these options in multi-database studies of comparative effectiveness and safety of medical products. The scientific considerations must be balanced against what the data-contributing sites are able or willing to share. While pooling of person-level datasets remains the most familiar and analytically flexible approach, newer analytic and data sharing approaches that share less granular summary-level information may be equally valid and preferred in some multi-database studies, especially when sharing of person-level data is challenging or infeasible.
Published: 2020

35. Applying Machine Learning in Distributed Data Networks for Pharmacoepidemiologic and Pharmacovigilance Studies: Opportunities, Challenges, and Considerations

Author: Jenna Wong, Daniel Prieto-Alhambra, Peter R. Rijnbeek, Rishi J. Desai, Jenna M. Reps, and Sengwee Toh
Subjects: Pharmacology, Machine Learning, Pharmacoepidemiologic, Pharmacovigilance, SDG 3 - Good Health and Well-being, Databases, Factual, Pharmacoepidemiology, Humans, Pharmacology (medical), Toxicology, Distributed Data Network
Abstract: Increasing availability of electronic health databases capturing real-world experiences with medical products has garnered much interest in their use for pharmacoepidemiologic and pharmacovigilance studies. The traditional practice of having numerous groups use single databases to accomplish similar tasks and address common questions about medical products can be made more efficient through well-coordinated multi-database studies, greatly facilitated through distributed data network (DDN) architectures. Access to larger amounts of electronic health data within DDNs has created a growing interest in using data-adaptive machine learning (ML) techniques that can automatically model complex associations in high-dimensional data with minimal human guidance. However, the siloed storage and diverse nature of the databases in DDNs create unique challenges for using ML. In this paper, we discuss opportunities, challenges, and considerations for applying ML in DDNs for pharmacoepidemiologic and pharmacovigilance studies. We first discuss major types of activities performed by DDNs and how ML may be used. Next, we discuss practical data-related factors influencing how DDNs work in practice. We then combine these discussions and jointly consider how opportunities for ML are affected by practical data-related factors for DDNs, leading to several challenges. We present different approaches for addressing these challenges and highlight efforts that real-world DDNs have taken or are currently taking to help mitigate them. Despite these challenges, the time is ripe for the emerging interest to use ML in DDNs, and the utility of these data-adaptive modeling techniques in pharmacoepidemio- logic and pharmacovigilance studies will likely continue to increase in the coming years.
Published: 2022

36. Utility of fertility procedures and prenatal tests to estimate gestational age for live-births and stillbirths in electronic health plan databases

Author: Lockwood G. Taylor, Steven T. Bird, Danijela Stojanovic, Sengwee Toh, Judith C. Maro, Elnara Fazio‐Eynullayeva, Andrew B. Petrone, Rajani Rajbhandari, Susan E. Andrade, Kevin Haynes, Cheryl N. McMahill‐Walraven, Mayura Shinde, and Jennifer G. Lyons
Subjects: Fertility, Epidemiology, Pregnancy, Humans, Pharmacology (medical), Female, Gestational Age, Electronics, Stillbirth, Live Birth
Abstract: Current algorithms to evaluate gestational age (GA) during pregnancy rely on hospital coding at delivery and are not applicable to non-live births. We developed an algorithm using fertility procedures and fertility tests, without relying on delivery coding, to develop a novel GA algorithm in live-births and stillbirths.Three pregnancy cohorts were identified from 16 health-plans in the Sentinel System: 1) hospital admissions for live-birth, 2) hospital admissions for stillbirth, and 3) medical chart-confirmed stillbirths. Fertility procedures and prenatal tests, recommended within specific GA windows were evaluated for inclusion in our GA algorithm. Our GA algorithm was developed against a validated delivery-based GA algorithm in live-births, implemented within a sample of chart-confirmed stillbirths, and compared to national estimates of GA at stillbirth.Our algorithm, including fertility procedures and 11 prenatal tests, assigned a GA at delivery to 97.9% of live-births and 92.6% of stillbirths. For live-births (n = 4 701 207), it estimated GA within 2 weeks of a reference delivery-based GA algorithm in 82.5% of pregnancies, with a mean difference of 3.7 days. In chart-confirmed stillbirths (n = 49), it estimated GA within 2 weeks of the clinically recorded GA at delivery for 80% of pregnancies, with a mean difference of 11.1 days. Implementation of the algorithm in a cohort of stillbirths (n = 40 484) had an increased percentage of deliveries after 36 weeks compared to national estimates.In a population of primarily commercially-insured pregnant women, fertility procedures and prenatal tests can estimate GA with sufficient sensitivity and accuracy for utility in pregnancy studies.
Published: 2022

37. Preoperative Depression Status and Five Year Metabolic and Bariatric Surgery Outcomes in the PCORnet Bariatric Study Cohort

Author: Janelle W, Coughlin, Elizabeth, Nauman, Robert, Wellman, R Yates, Coley, Kathleen M, McTigue, Karen J, Coleman, Daniel B, Jones, Kristina, Lewis, Jonathan N, Tobin, Christina C, Wee, Stephanie L, Fitzpatrick, Jay R, Desai, Sameer, Murali, Ellen H, Morrow, Ann M, Rogers, G Craig, Wood, David G, Schlundt, Caroline M, Apovian, Meredith C, Duke, James C, McClay, Rohit, Soans, Rabih, Nemr, Neely, Williams, Anita, Courcoulas, John H, Holmes, Jane, Anau, Sengwee, Toh, Jessica L, Sturtevant, Casie E, Horgan, Andrea J, Cook, and David E, Arterburn
Abstract: To examine whether depression status before metabolic and bariatric surgery (MBS) influenced 5-year weight loss, diabetes, and safety/utilization outcomes in the PCORnet Bariatric Study.Research on the impact of depression on MBS outcomes is inconsistent with few large, long-term studies.Data were extracted from 23 health systems on 36,871 patients who underwent sleeve gastrectomy (SG; n=16,158) or gastric bypass (RYGB; n=20,713) from 2005-2015. Patients with and without a depression diagnosis in the year prior to MBS were evaluated for % total weight loss (%TWL), diabetes outcomes, and postsurgical safety/utilization (reoperations, revisions, endoscopy, hospitalizations, mortality) at 1, 3 and 5 years after MBS.27.1% of SG and 33.0% of RYGB patients had preoperative depression, and they had more medical and psychiatric comorbidities than those without depression. At 5 years of follow-up, those with depression, versus those without depression, had slightly less %TWL after RYGB, but not after SG (between group difference = 0.42%TWL, p=0.04). However, patients with depression had slightly larger HbA1c improvements after RYGB but not after SG (between group difference = -0.19, p=0.04). Baseline depression did not moderate diabetes remission or relapse, reoperations, revision, or mortality across operations; however, baseline depression did moderate the risk of endoscopy and repeat hospitalization across RYGB versus SG.Patients with depression undergoing RYGB and SG had similar weight loss, diabetes, and safety/utilization outcomes to those without depression. The effects of depression were clinically small compared to the choice of operation.
Published: 2022

38. Reproducing Protocol‐Based Studies Using Parameterizable Tools—Comparison of Analytic Approaches Used by Two Medical Product Surveillance Networks

Author: Robert W. Platt, Richard Platt, Mayura Shinde, Sengwee Toh, Emily C. Welch, Laurent Azoulay, Judith C. Maro, Ting-Ying Huang, and Kristian B. Filion
Subjects: Adult, Male, Canada, Adolescent, Computer science, Machine learning, computer.software_genre, Incretins, 030226 pharmacology & pharmacy, Cohort Studies, Young Adult, 03 medical and health sciences, 0302 clinical medicine, Product Surveillance, Postmarketing, Humans, Hypoglycemic Agents, Pharmacology (medical), Drug effect, Aged, Retrospective Studies, Heart Failure, Pharmacology, Protocol (science), Safety surveillance, business.industry, Hazard ratio, Middle Aged, United States, Replication (computing), 3. Good health, Pancreatitis, Medical product, 030220 oncology & carcinogenesis, Cohort, Female, Observational study, Artificial intelligence, business, computer, Follow-Up Studies
Abstract: The US Sentinel System and the Canadian Network for Observational Drug Effect Studies (CNODES) are two medical product safety surveillance networks. Using Sentinel's preprogrammed, parameterizable analytic tools, we reproduced two protocol-based studies conducted by CNODES to assess the risks of acute pancreatitis and heart failure (HF) associated with the use of incretin-based drugs, compared with use of ≥ 2 oral hypoglycemic agents. Results from the replication new-user cohort analyses aligned with those from the CNODES nested case-control studies. The adjusted hazard ratios were 0.95 (0.81-1.12; vs. 1.03 (0.87-1.22) in CNODES) for acute pancreatitis and 0.91 (0.84-1.00; vs. 0.82 (0.67-1.00) in CNODES) for HF among patients without HF history. The CNODES's common protocol approach allows studies tailored to specific safety questions, whereas the Sentinel's common data model plus pretested program approach enables more rapid analysis. Despite these differences, it is possible to obtain comparable results using both approaches.
Published: 2019

39. Privacy-protecting estimation of adjusted risk ratios using modified Poisson regression in multi-center studies

Author: Jessica G. Young, Di Shu, and Sengwee Toh
Subjects: Distributed analysis, Epidemiology, Computer science, Pooling, Health Informatics, Logistic regression, Multi-center studies, 01 natural sciences, 010104 statistics & probability, 03 medical and health sciences, symbols.namesake, 0302 clinical medicine, Statistics, Humans, Multicenter Studies as Topic, Privacy protection, Poisson Distribution, 030212 general & internal medicine, Poisson regression, 0101 mathematics, lcsh:R5-920, Models, Statistical, Risk ratio, Modified Poisson regression, Odds ratio, Confidence interval, Standard error, Distributed algorithm, Relative risk, symbols, Regression Analysis, Risk Adjustment, lcsh:Medicine (General), Algorithms, Confidentiality, Research Article
Abstract: Background Multi-center studies can generate robust and generalizable evidence, but privacy considerations and legal restrictions often make it challenging or impossible to pool individual-level data across data-contributing sites. With binary outcomes, privacy-protecting distributed algorithms to conduct logistic regression analyses have been developed. However, the risk ratio often provides a more transparent interpretation of the exposure-outcome association than the odds ratio. Modified Poisson regression has been proposed to directly estimate adjusted risk ratios and produce confidence intervals with the correct nominal coverage when individual-level data are available. There are currently no distributed regression algorithms to estimate adjusted risk ratios while avoiding pooling of individual-level data in multi-center studies. Methods By leveraging the Newton-Raphson procedure, we adapted the modified Poisson regression method to estimate multivariable-adjusted risk ratios using only summary-level information in multi-center studies. We developed and tested the proposed method using both simulated and real-world data examples. We compared its results with the results from the corresponding pooled individual-level data analysis. Results Our proposed method produced the same adjusted risk ratio estimates and standard errors as the corresponding pooled individual-level data analysis without pooling individual-level data across data-contributing sites. Conclusions We developed and validated a distributed modified Poisson regression algorithm for valid and privacy-protecting estimation of adjusted risk ratios and confidence intervals in multi-center studies. This method allows computation of a more interpretable measure of association for binary outcomes, along with valid construction of confidence intervals, without sharing of individual-level data.
Published: 2019

40. Can Observational Analyses of Routinely Collected Data Emulate Randomized Trials? Design and Feasibility of the Observational Patient Evidence for Regulatory Approval Science and Understanding Disease Project

Author: William Crown, Issa J. Dahabreh, Xiaojuan Li, Sengwee Toh, and Barbara Bierer
Subjects: Health Policy, Public Health, Environmental and Occupational Health
Abstract: The Observational Patient Evidence for Regulatory Approval Science and Understanding Disease (OPERAND) project examines whether real-world data (RWD) can be used to inform regulatory decision making.OPERAND evaluates whether observational analyses using RWD to emulate index trials can produce effect estimates similar to those of the trials and examines the impact of relaxing the eligibility criteria of the observational analyses to obtain samples that more closely match the real-world populations receiving the treatments. In OPERAND, 2 research teams independently attempt to emulate the ROCKET Atrial Fibrillation and LEAD-2 trials using OptumLabs data. This article describes the design of the project, summarizes the approaches of the 2 research teams, and presents feasibility results for 2 emulations using new-user designs.There were differences in the teams' conceptualizations of the emulation, design decisions for cohort identification, and resulting RWD cohorts. These differences occurred even though both teams were guided by the same index trials and had access to the same source of RWD.Reasonable alternative design and analysis approaches may be taken to answer the same research question, even when attempting to emulate the same index trial. Researcher decision making is an understudied and potentially important source of variability across RWD analyses.
Published: 2021

41. Real-world Use of and Spending on New Oral Targeted Cancer Drugs in the US, 2011-2018

Author: Austin Cosgrove, Xiaodong Guan, Huseyin Naci, Mengyuan Fu, Dennis Ross-Degnan, Anita K. Wagner, Sengwee Toh, Ziyue Xu, Christopher M. Booth, and Bishal Gyawali
Subjects: Drug, Male, medicine.medical_specialty, media_common.quotation_subject, Cancer drugs, Administration, Oral, Antineoplastic Agents, Drug Costs, law.invention, Food and drug administration, Randomized controlled trial, law, Neoplasms, Internal Medicine, Overall survival, Medicine, Humans, Drug Approval, media_common, Retrospective Studies, Original Investigation, business.industry, Cancer, Middle Aged, medicine.disease, Quality of evidence, Cross-Sectional Studies, Emergency medicine, Female, business
Abstract: Importance Launch prices of new cancer drugs in the US have substantially increased in recent years despite growing concerns about the quantity and quality of evidence supporting their approval by the US Food and Drug Administration (FDA). Objective To assess the use of and spending on new oral targeted cancer drugs among US residents with employer-sponsored insurance between 2011 and 2018, stratified by the strength of available evidence of benefit. Design, Setting, and Participants In this cross-sectional study, dispensing claims for oral targeted cancer drugs first approved by the FDA between January 1, 2011, and December 31, 2018, were analyzed. The number of patients with drugs dispensed and the total payment for all claims were aggregated by calendar year, and these outcomes were arrayed according to evidence underlying FDA approvals, including pivotal study design (availability of randomized clinical trials) and overall survival (OS) benefit, as documented in drug labels. This study was conducted from July 17, 2019, to July 23, 2021. Main Outcomes and Measures Annual and cumulative numbers of patients who had dispensing events, and annual and cumulative sums of payment for eligible drugs. Results Of 37 348 patients who had at least 1 of the 44 new oral targeted drugs dispensed between 2011 and 2018, 21 324 were men (57.1%); mean (SD) age was 64.1 (13.1) years. Most individuals (36 246 [97.0%]) received drugs for which evidence from randomized clinical trials existed; however, a growing share of patients received drugs without documented OS benefit during the study period: from 12.7% in 2011 to 58.8% in 2018. Cumulative spending on all sample drugs totaled $3.5 billion by the end of 2018, of which 96.8% was spent on drugs that were approved based on a pivotal randomized clinical trial. Cumulative spending on drugs without documented OS benefit ($1.8 billion [51.6%]) surpassed that on drugs with documented OS benefit ($1.7 billion [48.4%]) by the end of 2018. Conclusions and Relevance The findings of this cross-sectional study suggest that drugs used for treatment of cancer without documented OS benefits are adopted in the health system and account for substantial spending.
Published: 2021

42. Comparative Safety and Effectiveness of Roux-en-Y Gastric Bypass and Sleeve Gastrectomy for Weight Loss and Type 2 Diabetes Across Race and Ethnicity in the PCORnet Bariatric Study Cohort

Author: Karen J, Coleman, Robert, Wellman, Stephanie L, Fitzpatrick, Molly B, Conroy, Callie, Hlavin, Kristina H, Lewis, R Yates, Coley, Kathleen M, McTigue, Jonathan N, Tobin, Corrigan L, McBride, Jay R, Desai, Jeanne M, Clark, Sengwee, Toh, Jessica L, Sturtevant, Casie E, Horgan, Meredith C, Duke, Neely, Williams, Jane, Anau, Michael A, Horberg, Marc P, Michalsky, Andrea J, Cook, David E, Arterburn, Caroline M, Apovian, and William S, Richardson
Subjects: Adult, Glycated Hemoglobin, Adolescent, Gastric Bypass, Bariatric Surgery, Middle Aged, Obesity, Morbid, Cohort Studies, Treatment Outcome, Diabetes Mellitus, Type 2, Gastrectomy, Recurrence, Ethnic and Racial Minorities, Weight Loss, Ethnicity, Humans, Female, Surgery, Retrospective Studies
Abstract: ImportanceBariatric surgery is the most effective treatment for severe obesity; yet it is unclear whether the long-term safety and comparative effectiveness of these operations differ across racial and ethnic groups.ObjectiveTo compare outcomes of Roux-en-Y gastric bypass (RYGB) and sleeve gastrectomy (SG) across racial and ethnic groups in the National Patient-Centered Clinical Research Network (PCORnet) Bariatric Study.Design, Setting, and ParticipantsThis was a retrospective, observational, comparative effectiveness cohort study that comprised 25 health care systems in the PCORnet Bariatric Study. Patients were adults and adolescents aged 12 to 79 years who underwent a primary (first nonrevisional) RYGB or SG operation between January 1, 2005, and September 30, 2015, at participating health systems. Patient race and ethnicity included Black, Hispanic, White, other, and unrecorded. Data were analyzed from July 1, 2021, to January 17, 2022.ExposureRYGB or SG.OutcomesPercentage total weight loss (%TWL); type 2 diabetes remission, relapse, and change in hemoglobin A1c (HbA1c) level; and postsurgical safety and utilization outcomes (operations, interventions, revisions/conversions, endoscopy, hospitalizations, mortality, 30-day major adverse events) at 1, 3, and 5 years after surgery.ResultsA total of 36 871 patients (mean [SE] age, 45.0 [11.7] years; 29 746 female patients [81%]) were included in the weight analysis. Patients identified with the following race and ethnic categories: 6891 Black (19%), 8756 Hispanic (24%), 19 645 White (53%), 826 other (2%), and 783 unrecorded (2%). Weight loss and mean reductions in HbA1c level were larger for RYGB than SG in all years for Black, Hispanic, and White patients (difference in 5-year weight loss: Black, −7.6%; 95% CI, −8.0 to −7.1; P < .001; Hispanic, −6.2%; 95% CI, −6.6 to −5.9; P < .001; White, −5.9%; 95% CI, −6.3 to −5.7; P < .001; difference in change in year 5 HbA1c level: Black, −0.29; 95% CI, −0.51 to −0.08; P = .009; Hispanic, −0.45; 95% CI, −0.61 to −0.29; P < .001; and White, −0.25; 95% CI, −0.40 to −0.11; P = .001.) The magnitude of these differences was small among racial and ethnic groups (1%-3% of %TWL). Black and Hispanic patients had higher risk of hospitalization when they had RYGB compared with SG (hazard ratio [HR], 1.45; 95% CI, 1.17-1.79; P = .001 and 1.48; 95% CI, 1.22-1.79; P < .001, respectively). Hispanic patients had greater risk of all-cause mortality (HR, 2.41; 95% CI, 1.24-4.70; P = .01) and higher odds of a 30-day major adverse event (odds ratio, 1.92; 95% CI, 1.38-2.68; P < .001) for RYGB compared with SG. There was no interaction between race and ethnicity and operation type for diabetes remission and relapse.Conclusions and RelevanceVariability of the comparative effectiveness of operations for %TWL and HbA1c level across race and ethnicity was clinically small; however, differences in safety and utilization outcomes were clinically and statistically significant for Black and Hispanic patients who had RYGB compared with SG. These findings can inform shared decision-making regarding bariatric operation choice for different racial and ethnic groups of patients.
Published: 2022

43. Statin use is associated with lower risk of dementia in stroke patients: a community-based cohort study with inverse probability weighted marginal structural model analysis

Author: Zhirong Yang, Sengwee Toh, Xiaojuan Li, Duncan Edwards, Carol Brayne, Jonathan Mant, Edwards, Duncan [0000-0003-1500-2108], Brayne, Carol [0000-0001-5307-663X], and Apollo - University of Cambridge Repository
Subjects: Cohort Studies, Models, Structural, Stroke, Peptic Ulcer, Epidemiology, Statins, Humans, Dementia, cardiovascular diseases, Hydroxymethylglutaryl-CoA Reductase Inhibitors, Cohort study, Probability, Retrospective Studies
Abstract: Current evidence is inconclusive on cognitive benefits or harms of statins among stroke patients, who have high risk of dementia. This observational cohort study investigated the association between statin use and post-stroke dementia using data from the Clinical Practice Research Datalink. Patients without prior dementia who had an incident stroke but received no statins in the preceding year were followed for up to 10 years. We used inverse probability weighted marginal structural models to estimate observational analogues of intention-to-treat (ITT, statin initiation versus no initiation) and per-protocol (PP, sustained statin use versus no use) effects on the risk of dementia. To explore potential impact of unmeasured confounding, we examined the risks of coronary heart disease (CHD, positive control outcome), fracture and peptic ulcer (negative control outcomes). In 18,577 statin initiators and 14,613 non-initiators (mean follow-up of 4.2 years), the adjusted hazard ratio (aHR) for dementia was 0.70 (95% confidence interval [CI] 0.64-0.75) in ITT analysis and 0.55 (95%CI 0.50-0.62) in PP analysis. The corresponding aHRITT and aHRPP were 0.87 (95%CI 0.79-0.95) and 0.70 (95%CI 0.620.80) for CHD, 1.03 (95%CI 0.82-1.29) and 1.09 (95%CI 0.77-1.54) for peptic ulcer, and 0.88 (95%CI 0.80-0.96) and 0.86 (95%CI 0.75-0.98) for fracture. Statin initiation after stroke was associated with lower risk of dementia, with a potentially greater benefit in patients who persisted with statins over time. The observed association of statin use with post-stroke dementia may in part be overestimated due to unmeasured confounding shared with the association between statin use and fracture., NIHR
Published: 2021

44. Who gets treated for influenza: A surveillance study from the US Food and Drug Administration's Sentinel System

Author: Robert Orr, Catherine A. Panozzo, Greg Measer, Nicole Haug, Henry Francis, Sarah K. Dutcher, Alfred Sorbello, Noelle M. Cocoros, Austin Cosgrove, Sengwee Toh, and Crystal Garcia
Subjects: Microbiology (medical), medicine.medical_specialty, Epidemiology, medicine.medical_treatment, Cilastatin, Imipenem Drug Combination, Medicare, Antiviral Agents, Diabetes mellitus, Influenza, Human, medicine, Humans, Asthma, Aged, Retrospective Studies, Mechanical ventilation, business.industry, United States Food and Drug Administration, Retrospective cohort study, medicine.disease, Obesity, United States, Vaccination, Hospitalization, Oxygen, Infectious Diseases, Emergency medicine, Complication, business, Medicaid
Abstract: Objective:We describe the baseline characteristics and complications of individuals with influenza in the US FDA’s Sentinel System by antiviral treatment timing.Design:Retrospective cohort design.Patients:Individuals aged ≥6 months with outpatient diagnoses of influenza in June 2014–July 2017, 3 influenza seasons.Methods:We identified the comorbidities, vaccination history, influenza testing, and outpatient antiviral dispensings of individuals with influenza using administrative claims data from 13 data partners including the Centers for Medicare and Medicaid Services, integrated delivery systems, and commercial health plans. We assessed complications within 30 days: hospitalization, oxygen use, mechanical ventilation, critical care, ECMO, and death.Results:There were 1,090,333 influenza diagnoses in 2014–2015; 1,005,240 in 2016–2017; and 578,548 in 2017–2018. Between 49% and 55% of patients were dispensed outpatient treatment within 5 days. In all periods >80% of treated individuals received treatment on the day of diagnosis. Those treated on days 1–5 after diagnosis had higher prevalences of diabetes, chronic obstructive pulmonary disease, asthma, and obesity compared to those treated on the day of diagnosis or not treated at all. They also had higher rates of hospitalization, oxygen use, and critical care. In 2014–2015, among those aged ≥65 years, the rates of hospitalization were 45 per 1,000 diagnoses among those treated on day 0; 74 per 1,000 among those treated on days 1–5; and 50 per 1,000 among those who were untreated.Conclusions:In a large, national analysis, approximately half of people diagnosed with influenza in the outpatient setting were treated with antiviral medications. Delays in outpatient dispensed treatment were associated with higher prevalence of comorbidities and higher rates of complication.
Published: 2021

45. Effect of ICD-9-CM to ICD-10-CM coding system transition on identification of common conditions: An interrupted time series analysis

Author: Meng-Chen Hsu, Chi-Chuan Wang, Fang-Ju Lin, Sengwee Toh, Chih-Ying Lin, and Ling-Ya Huang
Subjects: Pediatrics, medicine.medical_specialty, Databases, Factual, Epidemiology, Anemia, Vascular disease, business.industry, valvular heart disease, Clinical Coding, ICD-10, Interrupted Time Series Analysis, medicine.disease, Liver disease, International Classification of Diseases, medicine, Prevalence, Humans, Pharmacology (medical), Segmented regression, business, Algorithms, Coding (social sciences)
Abstract: Purpose To evaluate the effect of diagnostic coding system transition on the identification of common conditions recorded in Taiwan's national claims database. Methods Using the National Health Insurance Research Database, we estimated the 3-month prevalence of recorded diagnosis of 32 conditions based on the ICD-9-CM codes in 2014-2015 and the ICD-10-CM codes in 2016-2017. Two algorithms were assessed for ICD-10-CM: validated ICD-10 codes in the literature and codes translated from ICD-9-CM using an established mapping algorithm. We used segmented regression analysis on time-series data to examine changes in the 3-month prevalence (both level and trend) before and after the ICD-10-CM implementation. Results Significant changes in the level were found in 19 and 11 conditions when using the ICD-10 codes from the literature and mapping algorithm, respectively. The conditions with inconsistent levels by both of the algorithms were valvular heart disease, peripheral vascular disease, mild liver disease, moderate to severe liver disease, metastatic cancer, rheumatoid arthritis and collagen vascular diseases, coagulopathy, blood loss anemia, deficiency anemia, alcohol abuse, and psychosis. Nine conditions had significant changes in the trend when using the ICD-10 codes from the literature or mapping algorithm. Conclusions Less than half of the 32 conditions studied had a smooth transition between the ICD-9-CM and ICD-10-CM coding systems. Researchers should pay attention to the conditions where the coding definitions result in inconsistent time series estimates.
Published: 2021

46. Distributed Networks of Databases Analyzed Using Common Protocols and/or Common Data Models

Author: Joshua J. Gagne, Sengwee Toh, Nicole L. Pratt, Olaf H. Klungel, and Robert W. Platt
Subjects: business.industry, Meta-analysis, Propensity score matching, Medicine, Data mining, business, computer.software_genre, computer, Data modeling
Published: 2019

47. Privacy-protecting multivariable-adjusted distributed regression analysis for multi-center pediatric study

Author: Simon Lin, Micah Prochaska, Jonathan Finkelstein, F. Sessions Cole, Sengwee Toh, Brad Appelhans, Douglas Lunsford, Zachary Willis, Dominick Lemas, Allison O'Neill, Marc B. Rosenman, Stephanie L. Fitzpatrick, Pi-I D. Lin, Elyse O. Kharbanda, Maria Rayas, William Heerman, David Arterburn, Jessica L. Sturtevant, Daksha Ranade, Goutham Rao, Ritu Khare, Juliane S. Reynolds, Mary Jo Messito, H. Timothy Bunnell, Carmen R. Isasi, Matthew F. Daley, Janne Boone-Heinenon, Melanie Jay, Holly L. Peay, Amanda F. Dempsey, Jason P. Block, Erick Moyneur, Christopher B. Forrest, Jessica G. Young, Bradley K. Taylor, L. Charles Bailey, Sheryl L. Rifas-Shiman, Casie Horgan, Andrew L. Brickman, and Michael A. Horberg
Subjects: Male, Pediatric Obesity, Databases, Factual, Computer science, Pooling, Article, Body Mass Index, Health data, 03 medical and health sciences, 0302 clinical medicine, 030225 pediatrics, Linear regression, Statistics, Electronic Health Records, Humans, Child, Computer Security, Information Dissemination, Multivariable calculus, Age Factors, Infant, Newborn, Infant, Regression analysis, Learning Health System, Regression, Anti-Bacterial Agents, Standard error, Privacy, Child, Preschool, Multivariate Analysis, Pediatrics, Perinatology and Child Health, Linear Models, Regression Analysis, Feasibility Studies, Female, Body mass index, Confidentiality, 030217 neurology & neurosurgery
Abstract: BACKGROUND Privacy-protecting analytic approaches without centralized pooling of individual-level data, such as distributed regression, are particularly important for vulnerable populations, such as children, but these methods have not yet been tested in multi-center pediatric studies. METHODS Using the electronic health data from 34 healthcare institutions in the National Patient-Centered Clinical Research Network (PCORnet), we fit 12 multivariable-adjusted linear regression models to assess the associations of antibiotic use
Published: 2019

48. Inverse probability weighted Cox model in multi-site studies without sharing individual-level data

Author: Di Shu, Bruce Fireman, Sengwee Toh, and Kazuki Yoshida
Subjects: Data Analysis, Statistics and Probability, Epidemiology, Proportional hazards model, Computer science, Inverse probability weighting, Pooling, Hazard ratio, Inference, Variance (accounting), 01 natural sciences, Article, 010104 statistics & probability, 03 medical and health sciences, 0302 clinical medicine, Inverse probability, Health Information Management, Research Design, Statistics, Table (database), 030212 general & internal medicine, 0101 mathematics, Probability, Proportional Hazards Models
Abstract: The inverse probability weighted Cox proportional hazards model can be used to estimate the marginal hazard ratio. In multi-site studies, it may be infeasible to pool individual-level datasets due to privacy and other considerations. We propose three methods for making inference on hazard ratios without the need for pooling individual-level datasets across sites. The first method requires a summary-level eight-column risk-set table to produce the same hazard ratio estimate and robust sandwich variance estimate as those from the corresponding pooled individual-level data analysis (reference analysis). The second and third methods, which are based on two bootstrap re-sampling strategies, require a summary-level four-column risk-set table and bootstrap-based risk-set tables from each site to produce the same hazard ratio and bootstrap variance estimates as those from their reference analyses. All three methods require only one file transfer between the data-contributing sites and the analysis center. We justify these methods theoretically, illustrate their use, and demonstrate their statistical performance using both simulated and real-world data.
Published: 2019

49. Evaluating the use of bootstrapping in cohort studies conducted with 1:1 propensity score matching—A plasmode simulation study

Author: Rishi J. Desai, Younathan Abdia, Hana Lee, Joshua J. Gagne, Shirley V. Wang, Michael Nguyen, Richard Wyss, Jessica M. Franklin, Sengwee Toh, Margaret Johnson, Jacqueline M. Major, and Sara Karami
Subjects: Matching (statistics), Percentile, Epidemiology, Administration, Oral, 030226 pharmacology & pharmacy, Cohort Studies, 03 medical and health sciences, 0302 clinical medicine, Resampling, Atrial Fibrillation, Outcome Assessment, Health Care, Statistics, Humans, Medicine, Computer Simulation, Pharmacology (medical), 030212 general & internal medicine, Propensity Score, Proportional Hazards Models, Parametric statistics, business.industry, Anticoagulants, Confidence interval, Standard error, Bootstrapping (electronics), Research Design, Data Interpretation, Statistical, Propensity score matching, business, Monte Carlo Method
Abstract: PURPOSE Bootstrapping can account for uncertainty in propensity score (PS) estimation and matching processes in 1:1 PS-matched cohort studies. While theory suggests that the classical bootstrap can fail to produce proper coverage, practical impact of this theoretical limitation in settings typical to pharmacoepidemiology is not well studied. METHODS In a plasmode-based simulation study, we compared performance of the standard parametric approach, which ignores uncertainty in PS estimation and matching, with two bootstrapping methods. The first method only accounted for uncertainty introduced during the matching process (the observation resampling approach). The second method accounted for uncertainty introduced during both PS estimation and matching processes (the PS reestimation approach). Variance was estimated based on percentile and empirical standard errors, and treatment effect estimation was based on median and mean of the estimated treatment effects across 1000 bootstrap resamples. Two treatment prevalence scenarios (5% and 29%) across two treatment effect scenarios (hazard ratio of 1.0 and 2.0) were evaluated in 500 simulated cohorts of 10 000 patients each. RESULTS We observed that 95% confidence intervals from the bootstrapping approaches but not the standard approach, resulted in inaccurate coverage rates (98%-100% for the observation resampling approach, 99%-100% for the PS reestimation approach, and 95%-96% for standard approach). Treatment effect estimation based on bootstrapping approaches resulted in lower bias than the standard approach (less than 1.4% vs 4.1%) at 5% treatment prevalence; however, the performance was equivalent at 29% treatment prevalence. CONCLUSION Use of bootstrapping led to variance overestimation and inconsistent coverage, while coverage remained more consistent with parametric estimation.
Published: 2019

50. Pregnancy-Associated Cancer: A U.S. Population-Based Study

Author: Inna Dashevsky, De-Kun Li, Pamala A. Pawloski, Ann H. Partridge, Larissa Nekhlyudov, Sengwee Toh, Marsha A. Raebel, Susan E. Andrade, Debra P. Ritzwoller, and Carrie M Cottreau
Subjects: Adult, medicine.medical_specialty, Adolescent, Genital Neoplasms, Female, Breast Neoplasms, Cohort Studies, Young Adult, 03 medical and health sciences, 0302 clinical medicine, Pregnancy, Neoplasms, medicine, Humans, Registries, Thyroid Neoplasms, 030212 general & internal medicine, Child, Melanoma, business.industry, Obstetrics, Incidence, Incidence (epidemiology), Cancer, General Medicine, Middle Aged, medicine.disease, United States, Hematologic Neoplasms, 030220 oncology & carcinogenesis, Special Section: Cancer in Women, Female, business, Pregnancy Complications, Neoplastic, U s population
Abstract: Background: The incidence of pregnancy-associated cancer (PAC) is expected to increase as more women delay childbearing until later ages. However, information on frequency and incidence of PAC is scarce in the United States. Methods: We identified pregnancies among women aged 10–54 years during 2001–2013 from five U.S. health plans participating in the Cancer Research Network (CRN) and the Medication Exposure in Pregnancy Risk Evaluation Program (MEPREP). We extracted information from the health plans' administrative claims and electronic health record databases, tumor registries, and infants' birth certificate files to estimate the frequency and incidence of PAC, defined as cancer diagnosed during pregnancy and up to 1 year postpartum. Results: We identified 846 PAC events among 775,709 pregnancies from 2001 to 2013. The overall incidence estimate was 109.1 (95% confidence interval [CI] = 101.8–116.7) per 100,000 pregnancies. There was an increase in the incidence between 2002 and 2012 (the period during which complete data were available), from 75.0 (95% CI = 54.9–100.0) per 100,000 pregnancies in 2002 to 138.5 (95% CI = 109.1–173.3) per 100,000 pregnancies in 2012. The most common invasive cancers diagnosed were breast (n = 208, 24.6%), thyroid (n = 168, 19.9%), melanoma (n = 93, 11.0%), hematologic (n = 87, 10.3%), and cervix/uterus (n = 74, 8.7%). Conclusions: Our study provides contemporary incidence estimates of PAC from a population-based cohort of U.S. women. These estimates provide the data needed to help develop clinical and public health policies aimed at diagnosing PAC at an early stage and initiating appropriate therapeutic interventions in a timely manner.
Published: 2019

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

263 results on '"Sengwee, Toh"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources