Author: "van Smeden, Maarten" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"van Smeden, Maarten"' showing total 818 results

Start Over Author "van Smeden, Maarten"

818 results on '"van Smeden, Maarten"'

1. Performance evaluation of predictive AI models to support medical decisions: Overview and guidance

Author: Van Calster, Ben, Collins, Gary S., Vickers, Andrew J., Wynants, Laure, Kerr, Kathleen F., Barreñada, Lasai, Varoquaux, Gael, Singh, Karandeep, Moons, Karel G. M., Hernandez-boussard, Tina, Timmerman, Dirk, Mclernon, David J., Van Smeden, Maarten, and Steyerberg, Ewout W.
Subjects: Computer Science - Machine Learning, Statistics - Methodology, Statistics - Machine Learning
Abstract: A myriad of measures to illustrate performance of predictive artificial intelligence (AI) models have been proposed in the literature. Selecting appropriate performance measures is essential for predictive AI models that are developed to be used in medical practice, because poorly performing models may harm patients and lead to increased costs. We aim to assess the merits of classic and contemporary performance measures when validating predictive AI models for use in medical practice. We focus on models with a binary outcome. We discuss 32 performance measures covering five performance domains (discrimination, calibration, overall, classification, and clinical utility) along with accompanying graphical assessments. The first four domains cover statistical performance, the fifth domain covers decision-analytic performance. We explain why two key characteristics are important when selecting which performance measures to assess: (1) whether the measure's expected value is optimized when it is calculated using the correct probabilities (i.e., a "proper" measure), and (2) whether they reflect either purely statistical performance or decision-analytic performance by properly considering misclassification costs. Seventeen measures exhibit both characteristics, fourteen measures exhibited one characteristic, and one measure possessed neither characteristic (the F1 measure). All classification measures (such as classification accuracy and F1) are improper for clinically relevant decision thresholds other than 0.5 or the prevalence. We recommend the following measures and plots as essential to report: AUROC, calibration plot, a clinical utility measure such as net benefit with decision curve analysis, and a plot with probability distributions per outcome category., Comment: 60 pages, 8 tables, 11 figures, two supplementary appendices
Published: 2024

2. Extended sample size calculations for evaluation of prediction models using a threshold for classification

Author: Whittle, Rebecca, Ensor, Joie, Archer, Lucinda, Collins, Gary S., Dhiman, Paula, Denniston, Alastair, Alderman, Joseph, Legha, Amardeep, van Smeden, Maarten, Moons, Karel G., Cazier, Jean-Baptiste, Riley, Richard D., and Snell, Kym I. E.
Subjects: Statistics - Methodology
Abstract: When evaluating the performance of a model for individualised risk prediction, the sample size needs to be large enough to precisely estimate the performance measures of interest. Current sample size guidance is based on precisely estimating calibration, discrimination, and net benefit, which should be the first stage of calculating the minimum required sample size. However, when a clinically important threshold is used for classification, other performance measures can also be used. We extend the previously published guidance to precisely estimate threshold-based performance measures. We have developed closed-form solutions to estimate the sample size required to target sufficiently precise estimates of accuracy, specificity, sensitivity, PPV, NPV, and F1-score in an external evaluation study of a prediction model with a binary outcome. This approach requires the user to pre-specify the target standard error and the expected value for each performance measure. We describe how the sample size formulae were derived and demonstrate their use in an example. Extension to time-to-event outcomes is also considered. In our examples, the minimum sample size required was lower than that required to precisely estimate the calibration slope, and we expect this would most often be the case. Our formulae, along with corresponding Python code and updated R and Stata commands (pmvalsampsize), enable researchers to calculate the minimum sample size needed to precisely estimate threshold-based performance measures in an external evaluation study. These criteria should be used alongside previously published criteria to precisely estimate the calibration, discrimination, and net-benefit., Comment: 27 pages, 1 figure
Published: 2024

3. The harms of class imbalance corrections for machine learning based prediction models: a simulation study

Author: Carriero, Alex, Luijken, Kim, de Hond, Anne, Moons, Karel GM, van Calster, Ben, and van Smeden, Maarten
Subjects: Statistics - Methodology
Abstract: Risk prediction models are increasingly used in healthcare to aid in clinical decision making. In most clinical contexts, model calibration (i.e., assessing the reliability of risk estimates) is critical. Data available for model development are often not perfectly balanced with respect to the modeled outcome (i.e., individuals with vs. without the event of interest are not equally represented in the data). It is common for researchers to correct this class imbalance, yet, the effect of such imbalance corrections on the calibration of machine learning models is largely unknown. We studied the effect of imbalance corrections on model calibration for a variety of machine learning algorithms. Using extensive Monte Carlo simulations we compared the out-of-sample predictive performance of models developed with an imbalance correction to those developed without a correction for class imbalance across different data-generating scenarios (varying sample size, the number of predictors and event fraction). Our findings were illustrated in a case study using MIMIC-III data. In all simulation scenarios, prediction models developed without a correction for class imbalance consistently had equal or better calibration performance than prediction models developed with a correction for class imbalance. The miscalibration introduced by correcting for class imbalance was characterized by an over-estimation of risk and was not always able to be corrected with re-calibration. Correcting for class imbalance is not always necessary and may even be harmful for clinical prediction models which aim to produce reliable risk estimates on an individual basis.
Published: 2024

4. Safety of treating acute pulmonary embolism at home: an individual patient data meta-analysis.

Author: Luijten, Dieuwke, Douillet, Delphine, Luijken, Kim, Tromeur, Cecile, Penaloza, Andrea, Hugli, Olivier, Aujesky, Drahomir, Barco, Stefano, Bledsoe, Joseph, Chang, Kyle, Couturaud, Francis, den Exter, Paul, Font, Carme, Huisman, Menno, Jimenez, David, Kabrhel, Christopher, Kline, Jeffrey, Konstantinides, Stavros, van Mens, Thijs, Otero, Remedios, Peacock, W, Sanchez, Olivier, Stubblefield, William, Valerio, Luca, Vinson, David, Wells, Philip, van Smeden, Maarten, Roy, Pierre-Marie, and Klok, Frederikus
Subjects: Clinical decision-making, Early discharge, Emergency care, Outpatient care, Pulmonary embolism, Humans, Pulmonary Embolism, Acute Disease, Home Care Services, Hemorrhage, Male, Female, Anticoagulants, Randomized Controlled Trials as Topic, Prospective Studies, Aged, Natriuretic Peptide, Brain, Middle Aged
Abstract: BACKGROUND AND AIMS: Home treatment is considered safe in acute pulmonary embolism (PE) patients selected by a validated triage tool (e.g. simplified PE severity index score or Hestia rule), but there is uncertainty regarding the applicability in underrepresented subgroups. The aim was to evaluate the safety of home treatment by performing an individual patient-level data meta-analysis. METHODS: Ten prospective cohort studies or randomized controlled trials were identified in a systematic search, totalling 2694 PE patients treated at home (discharged within 24 h) and identified by a predefined triage tool. The 14- and 30-day incidences of all-cause mortality and adverse events (combined endpoint of recurrent venous thromboembolism, major bleeding, and/or all-cause mortality) were evaluated. The relative risk (RR) for 14- and 30-day mortalities and adverse events is calculated in subgroups using a random effects model. RESULTS: The 14- and 30-day mortalities were 0.11% [95% confidence interval (CI) 0.0-0.24, I2 = 0) and 0.30% (95% CI 0.09-0.51, I2 = 0). The 14- and 30-day incidences of adverse events were 0.56% (95% CI 0.28-0.84, I2 = 0) and 1.2% (95% CI 0.79-1.6, I2 = 0). Cancer was associated with increased 30-day mortality [RR 4.9; 95% prediction interval (PI) 2.7-9.1; I2 = 0]. Pre-existing cardiopulmonary disease, abnormal troponin, and abnormal (N-terminal pro-)B-type natriuretic peptide [(NT-pro)BNP] at presentation were associated with an increased incidence of 14-day adverse events [RR 3.5 (95% PI 1.5-7.9, I2 = 0), 2.5 (95% PI 1.3-4.9, I2 = 0), and 3.9 (95% PI 1.6-9.8, I2 = 0), respectively], but not mortality. At 30 days, cancer, abnormal troponin, and abnormal (NT-pro)BNP were associated with an increased incidence of adverse events [RR 2.7 (95% PI 1.4-5.2, I2 = 0), 2.9 (95% PI 1.5-5.7, I2 = 0), and 3.3 (95% PI 1.6-7.1, I2 = 0), respectively]. CONCLUSIONS: The incidence of adverse events in home-treated PE patients, selected by a validated triage tool, was very low. Patients with cancer had a three- to five-fold higher incidence of adverse events and death. Patients with increased troponin or (NT-pro)BNP had a three-fold higher risk of adverse events, driven by recurrent venous thromboembolism and bleeding.
Published: 2024

5. Validation of prognostic models predicting mortality or ICU admission in patients with COVID-19 in low- and middle-income countries: a global individual participant data meta-analysis

Author: Damen, Johanna A. A., Arshi, Banafsheh, van Smeden, Maarten, Bertagnolio, Silvia, Diaz, Janet V., Silva, Ronaldo, Thwin, Soe Soe, Wynants, Laure, and Moons, Karel G. M.
Published: 2024
Full Text: View/download PDF

6. Accuracy of urgency allocation in patients with shortness of breath calling out-of-hours primary care: a cross-sectional study

Author: Spek, Michelle, Venekamp, Roderick P., de Groot, Esther, Geersing, Geert-Jan, Erkelens, Daphne C. A., van Smeden, Maarten, Dobbe, Anna S. M., Delissen, Mathé, Rutten, Frans H., and Zwart, Dorien L.
Published: 2024
Full Text: View/download PDF

7. Limited incremental predictive value of the frailty index and other vulnerability measures from routine care data for mortality risk prediction in older patients with COVID-19 in primary care

Author: la Roi-Teeuw, Hannah M., Luijken, Kim, Blom, Marieke T., Gussekloo, Jacobijn, Mooijaart, Simon P., Polinder-Bos, Harmke A., van Smeden, Maarten, Geersing, Geert-Jan, and van den Dries, Carline J.
Published: 2024
Full Text: View/download PDF

8. Understanding metric-related pitfalls in image analysis validation

Author: Reinke, Annika, Tizabi, Minu D., Baumgartner, Michael, Eisenmann, Matthias, Heckmann-Nötzel, Doreen, Kavur, A. Emre, Rädsch, Tim, Sudre, Carole H., Acion, Laura, Antonelli, Michela, Arbel, Tal, Bakas, Spyridon, Benis, Arriel, Blaschko, Matthew, Buettner, Florian, Cardoso, M. Jorge, Cheplygina, Veronika, Chen, Jianxu, Christodoulou, Evangelia, Cimini, Beth A., Collins, Gary S., Farahani, Keyvan, Ferrer, Luciana, Galdran, Adrian, van Ginneken, Bram, Glocker, Ben, Godau, Patrick, Haase, Robert, Hashimoto, Daniel A., Hoffman, Michael M., Huisman, Merel, Isensee, Fabian, Jannin, Pierre, Kahn, Charles E., Kainmueller, Dagmar, Kainz, Bernhard, Karargyris, Alexandros, Karthikesalingam, Alan, Kenngott, Hannes, Kleesiek, Jens, Kofler, Florian, Kooi, Thijs, Kopp-Schneider, Annette, Kozubek, Michal, Kreshuk, Anna, Kurc, Tahsin, Landman, Bennett A., Litjens, Geert, Madani, Amin, Maier-Hein, Klaus, Martel, Anne L., Mattson, Peter, Meijering, Erik, Menze, Bjoern, Moons, Karel G. M., Müller, Henning, Nichyporuk, Brennan, Nickel, Felix, Petersen, Jens, Rafelski, Susanne M., Rajpoot, Nasir, Reyes, Mauricio, Riegler, Michael A., Rieke, Nicola, Saez-Rodriguez, Julio, Sánchez, Clara I., Shetty, Shravya, van Smeden, Maarten, Summers, Ronald M., Taha, Abdel A., Tiulpin, Aleksei, Tsaftaris, Sotirios A., Van Calster, Ben, Varoquaux, Gaël, Wiesenfarth, Manuel, Yaniv, Ziv R., Jäger, Paul F., and Maier-Hein, Lena
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Validation metrics are key for the reliable tracking of scientific progress and for bridging the current chasm between artificial intelligence (AI) research and its translation into practice. However, increasing evidence shows that particularly in image analysis, metrics are often chosen inadequately in relation to the underlying research problem. This could be attributed to a lack of accessibility of metric-related knowledge: While taking into account the individual strengths, weaknesses, and limitations of validation metrics is a critical prerequisite to making educated choices, the relevant knowledge is currently scattered and poorly accessible to individual researchers. Based on a multi-stage Delphi process conducted by a multidisciplinary expert consortium as well as extensive community feedback, the present work provides the first reliable and comprehensive common point of access to information on pitfalls related to validation metrics in image analysis. Focusing on biomedical image analysis but with the potential of transfer to other fields, the addressed pitfalls generalize across application domains and are categorized according to a newly created, domain-agnostic taxonomy. To facilitate comprehension, illustrations and specific examples accompany each pitfall. As a structured body of information accessible to researchers of all levels of expertise, this work enhances global comprehension of a key topic in image analysis validation., Comment: Shared first authors: Annika Reinke and Minu D. Tizabi; shared senior authors: Lena Maier-Hein and Paul F. J\"ager. Published in Nature Methods. arXiv admin note: text overlap with arXiv:2206.01653
Published: 2023
Full Text: View/download PDF

9. Cross-institution text mining to uncover clinical associations: a case study relating social factors and code status in intensive care medicine

Author: Sushil, Madhumita, Butte, Atul J., Schuit, Ewoud, van Smeden, Maarten, and Leeuwenberg, Artuur M.
Subjects: Computer Science - Computation and Language, Statistics - Methodology, 68T50, 68U35, 62-xx, 62P10, 92C60, 92D30, I.2.7, G.3
Abstract: Objective: Text mining of clinical notes embedded in electronic medical records is increasingly used to extract patient characteristics otherwise not or only partly available, to assess their association with relevant health outcomes. As manual data labeling needed to develop text mining models is resource intensive, we investigated whether off-the-shelf text mining models developed at external institutions, together with limited within-institution labeled data, could be used to reliably extract study variables to conduct association studies. Materials and Methods: We developed multiple text mining models on different combinations of within-institution and external-institution data to extract social factors from discharge reports of intensive care patients. Subsequently, we assessed the associations between social factors and having a do-not-resuscitate/intubate code. Results: Important differences were found between associations based on manually labeled data compared to text-mined social factors in three out of five cases. Adopting external-institution text mining models using manually labeled within-institution data resulted in models with higher F1-scores, but not in meaningfully different associations. Discussion: While text mining facilitated scaling analyses to larger samples leading to discovering a larger number of associations, the estimates may be unreliable. Confirmation is needed with better text mining models, ideally on a larger manually labeled dataset. Conclusion: The currently used text mining models were not sufficiently accurate to be used reliably in an association study. Model adaptation using within-institution data did not improve the estimates. Further research is needed to set conditions for reliable use of text mining in medical research.
Published: 2023

10. Updating methods for artificial intelligence–based clinical prediction models: a scoping review

Author: Meijerink, Lotta M., Dunias, Zoë S., Leeuwenberg, Artuur M., de Hond, Anne A.H., Jenkins, David A., Martin, Glen P., Sperrin, Matthew, Peek, Niels, Spijker, René, Hooft, Lotty, Moons, Karel G.M., van Smeden, Maarten, and Schuit, Ewoud
Published: 2025
Full Text: View/download PDF

11. Methodological quality assessment tools for diagnosis and prognosis research: overview and guidance

Author: Kaul, Tabea, Kellerhuis, Bas E., Damen, Johanna A.A., Schuit, Ewoud, Jenniskens, Kevin, van Smeden, Maarten, Reitsma, Johannes B., Hooft, Lotty, Moons, Karel G.M., and Yang, Bada
Published: 2025
Full Text: View/download PDF

12. Minimum Sample Size for Developing a Multivariable Prediction Model using Multinomial Logistic Regression

Author: Pate, Alexander, Riley, Richard D, Collins, Gary S, van Smeden, Maarten, Van Calster, Ben, Ensor, Joie, and Martin, Glen P
Subjects: Statistics - Methodology, Statistics - Applications
Abstract: Multinomial logistic regression models allow one to predict the risk of a categorical outcome with more than 2 categories. When developing such a model, researchers should ensure the number of participants (n) is appropriate relative to the number of events (E.k) and the number of predictor parameters (p.k) for each category k. We propose three criteria to determine the minimum n required in light of existing criteria developed for binary outcomes. The first criteria aims to minimise the model overfitting. The second aims to minimise the difference between the observed and adjusted R2 Nagelkerke. The third criterion aims to ensure the overall risk is estimated precisely. For criterion (i), we show the sample size must be based on the anticipated Cox-snell R2 of distinct one-to-one logistic regression models corresponding to the sub-models of the multinomial logistic regression, rather than on the overall Cox-snell R2 of the multinomial logistic regression. We tested the performance of the proposed criteria (i) through a simulation study, and found that it resulted in the desired level of overfitting. Criterion (ii) and (iii) are natural extensions from previously proposed criteria for binary outcomes. We illustrate how to implement the sample size criteria through a worked example considering the development of a multinomial risk prediction model for tumour type when presented with an ovarian mass. Code is provided for the simulation and worked example. We will embed our proposed criteria within the pmsampsize R library and Stata modules.
Published: 2022

13. Imputation and Missing Indicators for handling missing data in the development and implementation of clinical prediction models: a simulation study

Author: Sisk, Rose, Sperrin, Matthew, Peek, Niels, van Smeden, Maarten, and Martin, Glen P.
Subjects: Statistics - Methodology
Abstract: Background: Existing guidelines for handling missing data are generally not consistent with the goals of prediction modelling, where missing data can occur at any stage of the model pipeline. Multiple imputation (MI), often heralded as the gold standard approach, can be challenging to apply in the clinic. Clearly, the outcome cannot be used to impute data at prediction time. Regression imputation (RI) may offer a pragmatic alternative in the prediction context, that is simpler to apply in the clinic. Moreover, the use of missing indicators can handle informative missingness, but it is currently unknown how well they perform within CPMs. Methods: We performed a simulation study where data were generated under various missing data mechanisms to compare the predictive performance of CPMs developed using both imputation methods. We consider deployment scenarios where missing data is permitted/prohibited, and develop models that use/omit the outcome during imputation and include/omit missing indicators. Results: When complete data must be available at deployment, our findings were in line with widely used recommendations; that the outcome should be used to impute development data under MI, yet omitted under RI. When imputation is applied at deployment, omitting the outcome from the imputation at development was preferred. Missing indicators improved model performance in some specific cases, but can be harmful when missingness is dependent on the outcome. Conclusion: We provide evidence that commonly taught principles of handling missing data via MI may not apply to CPMs, particularly when data can be missing at deployment. In such settings, RI and missing indicator methods can (marginally) outperform MI. As shown, the performance of the missing data handling method must be evaluated on a study-by-study basis, and should be based on whether missing data are allowed at deployment., Comment: 42 pages. Submitted to Statistical Methods in Medical Research in October 2021
Published: 2022

14. Metrics reloaded: Recommendations for image analysis validation

Author: Maier-Hein, Lena, Reinke, Annika, Godau, Patrick, Tizabi, Minu D., Buettner, Florian, Christodoulou, Evangelia, Glocker, Ben, Isensee, Fabian, Kleesiek, Jens, Kozubek, Michal, Reyes, Mauricio, Riegler, Michael A., Wiesenfarth, Manuel, Kavur, A. Emre, Sudre, Carole H., Baumgartner, Michael, Eisenmann, Matthias, Heckmann-Nötzel, Doreen, Rädsch, Tim, Acion, Laura, Antonelli, Michela, Arbel, Tal, Bakas, Spyridon, Benis, Arriel, Blaschko, Matthew, Cardoso, M. Jorge, Cheplygina, Veronika, Cimini, Beth A., Collins, Gary S., Farahani, Keyvan, Ferrer, Luciana, Galdran, Adrian, van Ginneken, Bram, Haase, Robert, Hashimoto, Daniel A., Hoffman, Michael M., Huisman, Merel, Jannin, Pierre, Kahn, Charles E., Kainmueller, Dagmar, Kainz, Bernhard, Karargyris, Alexandros, Karthikesalingam, Alan, Kenngott, Hannes, Kofler, Florian, Kopp-Schneider, Annette, Kreshuk, Anna, Kurc, Tahsin, Landman, Bennett A., Litjens, Geert, Madani, Amin, Maier-Hein, Klaus, Martel, Anne L., Mattson, Peter, Meijering, Erik, Menze, Bjoern, Moons, Karel G. M., Müller, Henning, Nichyporuk, Brennan, Nickel, Felix, Petersen, Jens, Rajpoot, Nasir, Rieke, Nicola, Saez-Rodriguez, Julio, Sánchez, Clara I., Shetty, Shravya, van Smeden, Maarten, Summers, Ronald M., Taha, Abdel A., Tiulpin, Aleksei, Tsaftaris, Sotirios A., Van Calster, Ben, Varoquaux, Gaël, and Jäger, Paul F.
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Increasing evidence shows that flaws in machine learning (ML) algorithm validation are an underestimated global problem. Particularly in automatic biomedical image analysis, chosen performance metrics often do not reflect the domain interest, thus failing to adequately measure scientific progress and hindering translation of ML techniques into practice. To overcome this, our large international expert consortium created Metrics Reloaded, a comprehensive framework guiding researchers in the problem-aware selection of metrics. Following the convergence of ML methodology across application domains, Metrics Reloaded fosters the convergence of validation methodology. The framework was developed in a multi-stage Delphi process and is based on the novel concept of a problem fingerprint - a structured representation of the given problem that captures all aspects that are relevant for metric selection, from the domain interest to the properties of the target structure(s), data set and algorithm output. Based on the problem fingerprint, users are guided through the process of choosing and applying appropriate validation metrics while being made aware of potential pitfalls. Metrics Reloaded targets image analysis problems that can be interpreted as a classification task at image, object or pixel level, namely image-level classification, object detection, semantic segmentation, and instance segmentation tasks. To improve the user experience, we implemented the framework in the Metrics Reloaded online tool, which also provides a point of access to explore weaknesses, strengths and specific recommendations for the most common validation metrics. The broad applicability of our framework across domains is demonstrated by an instantiation for various biological and medical image analysis use cases., Comment: Shared first authors: Lena Maier-Hein, Annika Reinke. arXiv admin note: substantial text overlap with arXiv:2104.05642 Published in Nature Methods
Published: 2022
Full Text: View/download PDF

15. The influence of the dynamic context of the pandemic on the predictive performance of mortality predictions over time in older patients hospitalized for COVID-19

Author: Mooijaart, Simon P., Gussekloo, Jacobijn, Polinder-Bos, Harmke A., Moons, Karel G.M., van Smeden, Maarten, Peeters, Geeske, Melis, René J.F., Elders, Petra J.M., Festen, Jan, van der Linden, Carolien M.J., Jansen, Steffy W.M., Willems, Hanna C., van der Bo, Jessica M., van Raaij, Bas F.M., Zahra, Anum, Steyerberg, Ewout W., de Hond, Anne A.H., Smits, Rosalinde A.L., van der Klei, Veerle M.G.T.H., Minnema, Julia, Appelman, Brent, Smorenberg, Annemieke, Trompet, Stella, and Noordam, Raymond
Published: 2025
Full Text: View/download PDF

16. The harm of class imbalance corrections for risk prediction models: illustration and simulation using logistic regression

Author: Goorbergh, Ruben van den, van Smeden, Maarten, Timmerman, Dirk, and Van Calster, Ben
Subjects: Statistics - Methodology
Abstract: Methods to correct class imbalance, i.e. imbalance between the frequency of outcome events and non-events, are receiving increasing interest for developing prediction models. We examined the effect of imbalance correction on the performance of standard and penalized (ridge) logistic regression models in terms of discrimination, calibration, and classification. We examined random undersampling, random oversampling and SMOTE using Monte Carlo simulations and a case study on ovarian cancer diagnosis. The results indicated that all imbalance correction methods led to poor calibration (strong overestimation of the probability to belong to the minority class), but not to better discrimination in terms of the area under the receiver operating characteristic curve. Imbalance correction improved classification in terms of sensitivity and specificity, but similar results were obtained by shifting the probability threshold instead. Our study shows that outcome imbalance is not a problem in itself, and that imbalance correction may even worsen model performance., Comment: Main paper 21 pages, Supplement 53 pages
Published: 2022

17. Estimating uncertainty when providing individual cardiovascular risk predictions: a Bayesian survival analysis

Author: Hageman, Steven H.J., Post, Richard A.J., Visseren, Frank L.J., McEvoy, J. William, Jukema, J. Wouter, Smulders, Yvo, van Smeden, Maarten, and Dorresteijn, Jannick A.N.
Published: 2024
Full Text: View/download PDF

18. Understanding random resampling techniques for class imbalance correction and their consequences on calibration and discrimination of clinical risk prediction models

Author: Piccininni, Marco, Wechsung, Maximilian, Van Calster, Ben, Rohmann, Jessica L., Konigorski, Stefan, and van Smeden, Maarten
Published: 2024
Full Text: View/download PDF

19. SPIN-PM: a consensus framework to evaluate the presence of spin in studies on prediction models

Author: Andaur Navarro, Constanza L., Damen, Johanna A.A., Ghannad, Mona, Dhiman, Paula, van Smeden, Maarten, Reitsma, Johannes B., Collins, Gary S., Riley, Richard D., Moons, Karel G.M., and Hooft, Lotty
Published: 2024
Full Text: View/download PDF

20. Diabetes and risk of acute coronary syndrome in callers with chest discomfort: Cross-sectional study in out-of-hours primary care

Author: Spek, Michelle, Erkelens, Daphne C.A., van het Goor – van Wezep, Coralie, van Smeden, Maarten, Den Ruijter, Hester M., Wouters, Loes T.C.M., Venekamp, Roderick P., Rutten, Frans H., and Zwart, Dorien L.
Published: 2024
Full Text: View/download PDF

21. Evaluation of the Value of Waist Circumference and Metabolomics in the Estimation of Visceral Adipose Tissue.

Author: Boone, Sebastiaan C, van Smeden, Maarten, Rosendaal, Frits R, le Cessie, Saskia, Groenwold, Rolf HH, Jukema, J Wouter, van Dijk, Ko Willems, Lamb, Hildo J, Greenland, Philip, Neeland, Ian J, Allison, Matthew A, Criqui, Michael H, Budoff, Matthew J, Lind, Lars L, Kullberg, Joel, Ahlström, Håkan, Mook-Kanamori, Dennis O, and de Mutsert, Renée
Subjects: Adipose Tissue, Humans, Obesity, Body Mass Index, Prospective Studies, Middle Aged, Intra-Abdominal Fat, Waist Circumference, Metabolomics, added value, development, external validation, metabolomics, prediction, visceral adipose tissue, Cardiovascular, Atherosclerosis, Mathematical Sciences, Medical and Health Sciences, Epidemiology
Abstract: Visceral adipose tissue (VAT) is a strong prognostic factor for cardiovascular disease and a potential target for cardiovascular risk stratification. Because VAT is difficult to measure in clinical practice, we estimated prediction models with predictors routinely measured in general practice and VAT as outcome using ridge regression in 2,501 middle-aged participants from the Netherlands Epidemiology of Obesity study, 2008-2012. Adding waist circumference and other anthropometric measurements on top of the routinely measured variables improved the optimism-adjusted R2 from 0.50 to 0.58 with a decrease in the root-mean-square error (RMSE) from 45.6 to 41.5 cm2 and with overall good calibration. Further addition of predominantly lipoprotein-related metabolites from the Nightingale platform did not improve the optimism-corrected R2 and RMSE. The models were externally validated in 370 participants from the Prospective Investigation of Vasculature in Uppsala Seniors (PIVUS, 2006-2009) and 1,901 participants from the Multi-Ethnic Study of Atherosclerosis (MESA, 2000-2007). Performance was comparable to the development setting in PIVUS (R2 = 0.63, RMSE = 42.4 cm2, calibration slope = 0.94) but lower in MESA (R2 = 0.44, RMSE = 60.7 cm2, calibration slope = 0.75). Our findings indicate that the estimation of VAT with routine clinical measurements can be substantially improved by incorporating waist circumference but not by metabolite measurements.
Published: 2022

22. Predicting Benefit From FOLFOXIRI Plus Bevacizumab in Patients With Metastatic Colorectal Cancer

Author: Bond, Marinde J.G., van Smeden, Maarten, Degeling, Koen, Cremolini, Chiara, Schmoll, Hans-Joachim, Antoniotti, Carlotta, Lonardi, Sara, Murgioni, Sabina, Rossini, Daniele, Ibach, Stefan, Koopman, Miriam, Swijnenburg, Rutger-Jan, Punt, Cornelis J.A., May, Anne M., and Kwakman, Johannes J.M.
Published: 2024
Full Text: View/download PDF

23. Risk prediction models for discrete ordinal outcomes: calibration and the impact of the proportional odds assumption

Author: Edlinger, Michael, van Smeden, Maarten, Alber, Hannes F, Wanitschek, Maria, and Van Calster, Ben
Subjects: Statistics - Methodology
Abstract: Calibration is a vital aspect of the performance of risk prediction models, but research in the context of ordinal outcomes is scarce. This study compared calibration measures for risk models predicting a discrete ordinal outcome, and investigated the impact of the proportional odds assumption on calibration and overfitting. We studied the multinomial, cumulative, adjacent category, continuation ratio, and stereotype logit/logistic models. To assess calibration, we investigated calibration intercepts and slopes, calibration plots, and the estimated calibration index. Using large sample simulations, we studied the performance of models for risk estimation under various conditions, assuming that the true model has either a multinomial logistic form or a cumulative logit proportional odds form. Small sample simulations were used to compare the tendency for overfitting between models. As a case study, we developed models to diagnose the degree of coronary artery disease (five categories) in symptomatic patients. When the true model was multinomial logistic, proportional odds models often yielded poor risk estimates, with calibration slopes deviating considerably from unity even on large model development datasets. The stereotype logistic model improved the calibration slope, but still provided biased risk estimates for individual patients. When the true model had a cumulative logit proportional odds form, multinomial logistic regression provided biased risk estimates, although these biases were modest. Non-proportional odds models require more parameters to be estimated from the data, and hence suffered more from overfitting. Despite larger sample size requirements, we generally recommend multinomial logistic regression for risk prediction modeling of discrete ordinal outcomes., Comment: Revised version submitted to Statistics in Medicine
Published: 2021

24. Common Limitations of Image Processing Metrics: A Picture Story

Author: Reinke, Annika, Tizabi, Minu D., Sudre, Carole H., Eisenmann, Matthias, Rädsch, Tim, Baumgartner, Michael, Acion, Laura, Antonelli, Michela, Arbel, Tal, Bakas, Spyridon, Bankhead, Peter, Benis, Arriel, Blaschko, Matthew, Buettner, Florian, Cardoso, M. Jorge, Chen, Jianxu, Cheplygina, Veronika, Christodoulou, Evangelia, Cimini, Beth, Collins, Gary S., Engelhardt, Sandy, Farahani, Keyvan, Ferrer, Luciana, Galdran, Adrian, van Ginneken, Bram, Glocker, Ben, Godau, Patrick, Haase, Robert, Hamprecht, Fred, Hashimoto, Daniel A., Heckmann-Nötzel, Doreen, Hirsch, Peter, Hoffman, Michael M., Huisman, Merel, Isensee, Fabian, Jannin, Pierre, Kahn, Charles E., Kainmueller, Dagmar, Kainz, Bernhard, Karargyris, Alexandros, Karthikesalingam, Alan, Kavur, A. Emre, Kenngott, Hannes, Kleesiek, Jens, Kleppe, Andreas, Kohler, Sven, Kofler, Florian, Kopp-Schneider, Annette, Kooi, Thijs, Kozubek, Michal, Kreshuk, Anna, Kurc, Tahsin, Landman, Bennett A., Litjens, Geert, Madani, Amin, Maier-Hein, Klaus, Martel, Anne L., Mattson, Peter, Meijering, Erik, Menze, Bjoern, Moher, David, Moons, Karel G. M., Müller, Henning, Nichyporuk, Brennan, Nickel, Felix, Noyan, M. Alican, Petersen, Jens, Polat, Gorkem, Rafelski, Susanne M., Rajpoot, Nasir, Reyes, Mauricio, Rieke, Nicola, Riegler, Michael, Rivaz, Hassan, Saez-Rodriguez, Julio, Sánchez, Clara I., Schroeter, Julien, Saha, Anindo, Selver, M. Alper, Sharan, Lalith, Shetty, Shravya, van Smeden, Maarten, Stieltjes, Bram, Summers, Ronald M., Taha, Abdel A., Tiulpin, Aleksei, Tsaftaris, Sotirios A., Van Calster, Ben, Varoquaux, Gaël, Wiesenfarth, Manuel, Yaniv, Ziv R., Jäger, Paul, and Maier-Hein, Lena
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition
Abstract: While the importance of automatic image analysis is continuously increasing, recent meta-research revealed major flaws with respect to algorithm validation. Performance metrics are particularly key for meaningful, objective, and transparent performance assessment and validation of the used automatic algorithms, but relatively little attention has been given to the practical pitfalls when using specific metrics for a given image analysis task. These are typically related to (1) the disregard of inherent metric properties, such as the behaviour in the presence of class imbalance or small target structures, (2) the disregard of inherent data set properties, such as the non-independence of the test cases, and (3) the disregard of the actual biomedical domain interest that the metrics should reflect. This living dynamically document has the purpose to illustrate important limitations of performance metrics commonly applied in the field of image analysis. In this context, it focuses on biomedical image analysis problems that can be phrased as image-level classification, semantic segmentation, instance segmentation, or object detection task. The current version is based on a Delphi process on metrics conducted by an international consortium of image analysis experts from more than 60 institutions worldwide., Comment: Shared first authors: Annika Reinke and Minu D. Tizabi. This is a dynamic paper on limitations of commonly used metrics. It discusses metrics for image-level classification, semantic and instance segmentation, and object detection. For missing use cases, comments or questions, please contact a.reinke@dkfz.de. Substantial contributions to this document will be acknowledged with a co-authorship
Published: 2021

25. Tutorial: dos and don’ts in clinical prediction research for venous thromboembolism

Author: Nemeth, Banne, Smeets, Mark J.R., Cannegieter, Suzanne C., and van Smeden, Maarten
Published: 2024
Full Text: View/download PDF

26. External validation of six COVID-19 prognostic models for predicting mortality risk in older populations in a hospital, primary care, and nursing home setting

Author: Zahra, Anum, van Smeden, Maarten, Abbink, Evertine J., van den Berg, Jesse M., Blom, Marieke T., van den Dries, Carline J., Gussekloo, Jacobijn, Wouters, Fenne, Joling, Karlijn J., Melis, René, Mooijaart, Simon P., Peters, Jeannette B., Polinder-Bos, Harmke A., van Raaij, Bas F.M., Appelman, Brent, la Roi-Teeuw, Hannah M., Moons, Karel G.M., and Luijken, Kim
Published: 2024
Full Text: View/download PDF

27. Cross-institution natural language processing for reliable clinical association studies: a methodological exploration

Author: Sushil, Madhumita, Butte, Atul J., Schuit, Ewoud, van Smeden, Maarten, and Leeuwenberg, Artuur M.
Published: 2024
Full Text: View/download PDF

28. mecor: An R package for measurement error correction in linear regression models with a continuous outcome

Author: Nab, Linda, van Smeden, Maarten, Keogh, Ruth H., and Groenwold, Rolf H. H.
Subjects: Statistics - Methodology
Abstract: Measurement error in a covariate or the outcome of regression models is common, but is often ignored, even though measurement error can lead to substantial bias in the estimated covariate-outcome association. While several texts on measurement error correction methods are available, these methods remain seldomly applied. To improve the use of measurement error correction methodology, we developed mecor, an R package that implements measurement error correction methods for regression models with continuous outcomes. Measurement error correction requires information about the measurement error model and its parameters. This information can be obtained from four types of studies, used to estimate the parameters of the measurement error model: an internal validation study, a replicates study, a calibration study and an external validation study. In the package mecor, regression calibration methods and a maximum likelihood method are implemented to correct for measurement error in a continuous covariate in regression analyses. Additionally, methods of moments methods are implemented to correct for measurement error in the continuous outcome in regression analyses. Variance estimation of the corrected estimators is provided in closed form and using the bootstrap., Comment: 34 pages (including appendix), software package
Published: 2021

29. Comparing methods addressing multi-collinearity when developing prediction models

Author: Leeuwenberg, Artuur M., van Smeden, Maarten, Langendijk, Johannes A., van der Schaaf, Arjen, Mauer, Murielle E., Moons, Karel G. M., Reitsma, Johannes B., and Schuit, Ewoud
Subjects: Statistics - Methodology, 60, G.3
Abstract: Clinical prediction models are developed widely across medical disciplines. When predictors in such models are highly collinear, unexpected or spurious predictor-outcome associations may occur, thereby potentially reducing face-validity and explainability of the prediction model. Collinearity can be dealt with by exclusion of collinear predictors, but when there is no a priori motivation (besides collinearity) to include or exclude specific predictors, such an approach is arbitrary and possibly inappropriate. We compare different methods to address collinearity, including shrinkage, dimensionality reduction, and constrained optimization. The effectiveness of these methods is illustrated via simulations. In the conducted simulations, no effect of collinearity was observed on predictive outcomes. However, a negative effect of collinearity on the stability of predictor selection was found, affecting all compared methods, but in particular methods that perform strong predictor selection (e.g., Lasso).}
Published: 2021

30. Predicting adverse outcomes in adults with a community-acquired lower respiratory tract infection: a protocol for the development and validation of two prediction models for (i) all-cause hospitalisation and mortality and (ii) cardiovascular outcomes

Author: Rijk, Merijn H., Platteel, Tamara N., Geersing, Geert-Jan, Hollander, Monika, Dalmolen, Bert L. G. P., Little, Paul, Rutten, Frans H., van Smeden, Maarten, and Venekamp, Roderick P.
Published: 2023
Full Text: View/download PDF

31. Prognosis and prediction of antibiotic benefit in adults with clinically diagnosed acute rhinosinusitis: an individual participant data meta-analysis

Author: Hoogland, Jeroen, Takada, Toshihiko, van Smeden, Maarten, Rovers, Maroeska M., de Sutter, An I., Merenstein, Daniel, Kaiser, Laurent, Liira, Helena, Little, Paul, Bucher, Heiner C., Moons, Karel G. M., Reitsma, Johannes B., and Venekamp, Roderick P.
Published: 2023
Full Text: View/download PDF

32. A study protocol of external validation of eight COVID-19 prognostic models for predicting mortality risk in older populations in a hospital, primary care, and nursing home setting

Author: Zahra, Anum, Luijken, Kim, Abbink, Evertine J., van den Berg, Jesse M., Blom, Marieke T., Elders, Petra, Festen, Jan, Gussekloo, Jacobijn, Joling, Karlijn J., Melis, René, Mooijaart, Simon, Peters, Jeannette B., Polinder-Bos, Harmke A., van Raaij, Bas F. M., Smorenberg, Annemieke, la Roi-Teeuw, Hannah M., Moons, Karel G. M., and van Smeden, Maarten
Published: 2023
Full Text: View/download PDF

33. There is no such thing as a validated prediction model

Author: Van Calster, Ben, Steyerberg, Ewout W., Wynants, Laure, and van Smeden, Maarten
Published: 2023
Full Text: View/download PDF

34. Statistical Analysis—Measurement Error

Author: Brakenhoff, Timo B., van Smeden, Maarten, Oberski, Daniel L., Asselbergs, Folkert W., editor, Denaxas, Spiros, editor, Oberski, Daniel L., editor, and Moore, Jason H., editor
Published: 2023
Full Text: View/download PDF

35. Incomplete and possibly selective recording of signs, symptoms, and measurements in free text fields of primary care electronic health records of adults with lower respiratory tract infections

Author: Rijk, Merijn H., Platteel, Tamara N., Mulder, Marissa M.M., Geersing, Geert-Jan, Rutten, Frans H., van Smeden, Maarten, Venekamp, Roderick P., and Leeuwenberg, Tuur M.
Published: 2024
Full Text: View/download PDF

36. Accuracy of physicians’ intuitive risk estimation in the diagnostic management of pulmonary embolism: an individual patient data meta-analysis

Author: van Maanen, Rosanne, Martens, Emily S.L., Takada, Toshihiko, Roy, Pierre-Marie, de Wit, Kerstin, Parpia, Sameer, Kraaijpoel, Noémie, Huisman, Menno V., Wells, Philip S., Le Gal, Grégoire, Righini, Marc, Freund, Yonathan, Galipienzo, Javier, van Es, Nick, Blom, Jeanet W., Moons, Karel G.M., Rutten, Frans H., van Smeden, Maarten, Klok, Frederikus A., Geersing, Geert-Jan, and Luijken, Kim
Published: 2023
Full Text: View/download PDF

37. Poor handling of continuous predictors in clinical prediction models using logistic regression: a systematic review

Author: Ma, Jie, Dhiman, Paula, Qi, Cathy, Bullock, Garrett, van Smeden, Maarten, Riley, Richard D., and Collins, Gary S.
Published: 2023
Full Text: View/download PDF

38. Sensitivity analysis for bias due to a misclassfied confounding variable in marginal structural models

Author: Nab, Linda, Groenwold, Rolf H. H., van Smeden, Maarten, and Keogh, Ruth H.
Subjects: Statistics - Methodology
Abstract: In observational research treatment effects, the average treatment effect (ATE) estimator may be biased if a confounding variable is misclassified. We discuss the impact of classification error in a dichotomous confounding variable in analyses using marginal structural models estimated using inverse probability weighting (MSMs-IPW) and compare this with its impact in conditional regression models, focusing on a point-treatment study with a continuous outcome. Expressions were derived for the bias in the ATE estimator from a MSM-IPW and conditional model by using the potential outcome framework. Based on these expressions, we propose a sensitivity analysis to investigate and quantify the bias due to classification error in a confounding variable in MSMs-IPW. Compared to bias in the ATE estimator from a conditional model, the bias in MSM-IPW can be dissimilar in magnitude but the bias will always be equal in sign. A simulation study was conducted to study the finite sample performance of MSMs-IPW and conditional models if a confounding variable is misclassified. Simulation results showed that confidence intervals of the treatment effect obtained from MSM-IPW are generally wider and coverage of the true treatment effect is higher compared to a conditional model, ranging from over coverage if there is no classification error to smaller under coverage when there is classification error. The use of the bias expressions to inform a sensitivity analysis was demonstrated in a study of blood pressure lowering therapy. It is important to consider the potential impact of classification error in a confounding variable in studies of treatment effects and a sensitivity analysis provides an opportunity to quantify the impact of such errors on causal conclusions. An online tool for sensitivity analyses was developed: https://lindanab.shinyapps.io/SensitivityAnalysis., Comment: 25 pages, 3 figures, 3 tables
Published: 2019

39. On the variability of regression shrinkage methods for clinical prediction models: simulation study on predictive performance

Author: Van Calster, Ben, van Smeden, Maarten, and Steyerberg, Ewout W.
Subjects: Statistics - Methodology, 62J07
Abstract: When developing risk prediction models, shrinkage methods are recommended, especially when the sample size is limited. Several earlier studies have shown that the shrinkage of model coefficients can reduce overfitting of the prediction model and subsequently result in better predictive performance on average. In this simulation study, we aimed to investigate the variability of regression shrinkage on predictive performance for a binary outcome, with focus on the calibration slope. The slope indicates whether risk predictions are too extreme (slope < 1) or not extreme enough (slope > 1). We investigated the following shrinkage methods in comparison to standard maximum likelihood estimation: uniform shrinkage (likelihood-based and bootstrap-based), ridge regression, penalized maximum likelihood, LASSO regression, adaptive LASSO, non-negative garrote, and Firth's correction. There were three main findings. First, shrinkage improved calibration slopes on average. Second, the between-sample variability of calibration slopes was often increased relative to maximum likelihood. Among the shrinkage methods, the bootstrap-based uniform shrinkage worked well overall. In contrast to other shrinkage approaches, Firth's correction had only a small shrinkage effect but did so with low variability. Third, the correlation between the estimated shrinkage and the optimal shrinkage to remove overfitting was typically negative. Hence, although shrinkage improved predictions on average, it often worked poorly in individual datasets, in particular when shrinkage was most needed. The observed variability of shrinkage methods implies that these methods do not solve problems associated with small sample size or low number of events per variable., Comment: 138 pages (incl 114 supplementary pages). Main document: 5 figures and 2 tables
Published: 2019

40. How to conduct a systematic review and meta-analysis of prognostic model studies

Author: Damen, Johanna A.A., Moons, Karel G.M., van Smeden, Maarten, and Hooft, Lotty
Published: 2023
Full Text: View/download PDF

41. Noninvasive diagnostic work-up for suspected acute pulmonary embolism during pregnancy: a systematic review and meta-analysis of individual patient data

Author: Stals, Milou A.M., Moumneh, Thomas, Ainle, Fionnuala Ni, Aujesky, Drahomir, van Bemmel, Thomas, Bertoletti, Laurent, Bistervels, Ingrid M., Chauleur, Céline, Couturaud, Francis, van Dooren, Yordi P.A., Elias, Antoine, Faber, Laura M., Le Gall, Catherine, Hofstee, Herman M.A., van der Hulle, Tom, Kruip, Marieke J.H.A., Maignan, Maxime, Mairuhu, Albert T.A., Middeldorp, Saskia, Le Moigne, Emmanuelle, Nijkeuter, Mathilde, van der Pol, Liselotte M., Robert-Ebadi, Helia, Roy, Pierre-Marie, Sanchez, Olivier, Schmidt, Jeannot, van Smeden, Maarten, Tromeur, Cecile, Wolde, Marije ten, Righini, Marc, Le Gal, Grégoire, Huisman, Menno V., and Klok, Frederikus A.
Published: 2023
Full Text: View/download PDF

42. A weighting method for simultaneous adjustment for confounding and joint exposure-outcome misclassifications

Author: de Vries, Bas B. L. Penning, van Smeden, Maarten, and Groenwold, Rolf H. H.
Subjects: Statistics - Methodology
Abstract: Joint misclassification of exposure and outcome variables can lead to considerable bias in epidemiological studies of causal exposure-outcome effects. In this paper, we present a new maximum likelihood based estimator for the marginal causal odd-ratio that simultaneously adjusts for confounding and several forms of joint misclassification of the exposure and outcome variables. The proposed method relies on validation data for the construction of weights that account for both sources of bias. The weighting estimator, which is an extension of the exposure misclassification weighting estimator proposed by Gravel and Platt (Statistics in Medicine, 2018), is applied to reinfarction data. Simulation studies were carried out to study its finite sample properties and compare it with methods that do not account for confounding or misclassification. The new estimator showed favourable large sample properties in the simulations. Further research is needed to study the sensitivity of the proposed method and that of alternatives to violations of their assumptions. The implementation of the estimator is facilitated by a new R function in an existing R package., Comment: 36 pages, 7 tables, 1 figure
Published: 2019

43. Minimal reporting improvement after peer review in reports of COVID-19 prediction models: systematic review

Author: Hudda, Mohammed T., Archer, Lucinda, van Smeden, Maarten, Moons, Karel G.M., Collins, Gary S., Steyerberg, Ewout W., Wahlich, Charlotte, Reitsma, Johannes B., Riley, Richard D., Van Calster, Ben, and Wynants, Laure
Published: 2023
Full Text: View/download PDF

44. Systematic review identifies the design and methodological conduct of studies on machine learning-based prediction models

Author: Andaur Navarro, Constanza L., Damen, Johanna A.A., van Smeden, Maarten, Takada, Toshihiko, Nijman, Steven W.J., Dhiman, Paula, Ma, Jie, Collins, Gary S., Bajpai, Ram, Riley, Richard D., Moons, Karel G.M., and Hooft, Lotty
Published: 2023
Full Text: View/download PDF

45. Measurement error in continuous endpoints in randomised trials: problems and solutions

Author: Nab, Linda, Groenwold, Rolf H. H., Welsing, Paco M. J., and van Smeden, Maarten
Subjects: Statistics - Methodology
Abstract: In randomised trials, continuous endpoints are often measured with some degree of error. This study explores the impact of ignoring measurement error, and proposes methods to improve statistical inference in the presence of measurement error. Three main types of measurement error in continuous endpoints are considered: classical, systematic and differential. For each measurement error type, a corrected effect estimator is proposed. The corrected estimators and several methods for confidence interval estimation are tested in a simulation study. These methods combine information about error-prone and error-free measurements of the endpoint in individuals not included in the trial (external calibration sample). We show that if measurement error in continuous endpoints is ignored, the treatment effect estimator is unbiased when measurement error is classical, while Type-II error is increased at a given sample size. Conversely, the estimator can be substantially biased when measurement error is systematic or differential. In those cases, bias can largely be prevented and inferences improved upon using information from an external calibration sample, of which the required sample size increases as the strength of the association between the error-prone and error-free endpoint decreases. Measurement error correction using already a small (external) calibration sample is shown to improve inferences and should be considered in trials with error-prone endpoints. Implementation of the proposed correction methods is accommodated by a new software package for R., Comment: 37 pages, 4 figures, 3 tables
Published: 2018

46. Propensity score estimation using classification and regression trees in the presence of missing covariate data

Author: de Vries, Bas B. L. Penning, van Smeden, Maarten, and Groenwold, Rolf H. H.
Subjects: Statistics - Machine Learning, Computer Science - Machine Learning
Abstract: Data mining and machine learning techniques such as classification and regression trees (CART) represent a promising alternative to conventional logistic regression for propensity score estimation. Whereas incomplete data preclude the fitting of a logistic regression on all subjects, CART is appealing in part because some implementations allow for incomplete records to be incorporated in the tree fitting and provide propensity score estimates for all subjects. Based on theoretical considerations, we argue that the automatic handling of missing data by CART may however not be appropriate. Using a series of simulation experiments, we examined the performance of different approaches to handling missing covariate data; (i) applying the CART algorithm directly to the (partially) incomplete data, (ii) complete case analysis, and (iii) multiple imputation. Performance was assessed in terms of bias in estimating exposure-outcome effects \add{among the exposed}, standard error, mean squared error and coverage. Applying the CART algorithm directly to incomplete data resulted in bias, even in scenarios where data were missing completely at random. Overall, multiple imputation followed by CART resulted in the best performance. Our study showed that automatic handling of missing data in CART can cause serious bias and does not outperform multiple imputation as a means to account for missing data., Comment: 29 pages, 5 tables
Published: 2018

47. Impact of predictor measurement heterogeneity across settings on performance of prediction models: a measurement error perspective

Author: Luijken, Kim, Groenwold, Rolf H. H., van Calster, Ben, Steyerberg, Ewout W., and van Smeden, Maarten
Subjects: Statistics - Methodology, 97K80
Abstract: It is widely acknowledged that the predictive performance of clinical prediction models should be studied in patients that were not part of the data in which the model was derived. Out-of-sample performance can be hampered when predictors are measured differently at derivation and external validation. This may occur, for instance, when predictors are measured using different measurement protocols or when tests are produced by different manufacturers. Although such heterogeneity in predictor measurement between deriviation and validation data is common, the impact on the out-of-sample performance is not well studied. Using analytical and simulation approaches, we examined out-of-sample performance of prediction models under various scenarios of heterogeneous predictor measurement. These scenarios were defined and clarified using an established taxonomy of measurement error models. The results of our simulations indicate that predictor measurement heterogeneity can induce miscalibration of prediction and affects discrimination and overall predictive accuracy, to extents that the prediction model may no longer be considered clinically useful. The measurement error taxonomy was found to be helpful in identifying and predicting effects of heterogeneous predictor measurements between settings of prediction model derivation and validation. Our work indicates that homogeneity of measurement strategies across settings is of paramount importance in prediction research., Comment: 32 pages, 4 figures
Published: 2018

48. Exploring expressed concerns and uncanny feeling in patients with shortness of breath calling out-of-hours primary care.

Author: Spek, Michelle, Zwart, Dorien L., de Groot, Esther, Timmerman, Michelle R., van Smeden, Maarten, Erkelens, Daphne C. A., Dobbe, Anna S. M., Delissen, Mathé, Rutten, Frans H., and Venekamp, Roderick P.
Abstract: Background: Patients contacting out-of-hours primary care (OHS-PC) with shortness of breath (SOB) are often concerned. Sometimes, they also have an uncanny feeling; existential anxiety that something is wrong in their body. How concerns and uncanny feeling are related to critical medical conditions that cause SOB is unknown. We therefore explored the relation between expressed concerns and researcher's judged uncanny feeling among patients who contact OHS-PC for SOB with potential life-threatening events (LTEs) as the outcome. Methods: This is an explorative cross-sectional study. We analysed telephone triage conversations from patients with SOB who contacted Dutch OHS-PC between September 2020 and August 2021. We recorded whether patients expressed concerns and we judged whether patients had an uncanny feeling. We calculated odds ratios (ORs) for the association between (i) expressed concerns and (ii) uncanny feeling with the outcome potential LTEs. Results: Of the 1,843 patients with SOB, 43.6% patients expressed concerns and 33.0% had an uncanny feeling. Potential LTEs were similarly present among those who did and did not express concerns (OR: 1.07; 95% CI 0.84–1.37, mOR: 1.07; 95% CI 0.83–1.36), whereas potential LTEs were more often present among those with an uncanny feeling compared to those without such feeling (OR: 1.36; 95% CI 1.06–1.75, mOR: 1.35; 95% CI 1.05–1.74). Conclusions: Among patients who contacted OHS-PC with SOB, a perceived uncanny feeling of the patient was associated with a higher odd of potential LTEs, while patient's expressed concerns were not. Critical reflective interpretation is needed as uncanny feelings are difficult to judge. Nevertheless, our results implicate that further research into uncanny feelings in telephone triage could further improve the understanding of the relation with potential LTEs. Furthermore, this could be used to investigate how triage nurses may become more sensitive to what the patient is feeling but not explicitly saying such as by paying special attention to paralanguage. Trial registration: The Netherlands Trial Register, number: NL9682, registration date: 20–08-2021. [ABSTRACT FROM AUTHOR]
Published: 2025
Full Text: View/download PDF

49. Charlson comorbidity index has no incremental value for mortality risk prediction in nursing home residents with COVID-19 disease.

Author: Zahra, Anum, van Smeden, Maarten, Elders, Petra J. M., Festen, Jan, Gussekloo, Jacobijn, Joling, Karlijn J., van Loon, Anouk, Luijken, Kim, Melis, René J. F., Mooijaart, Simon P., Moons, Karel G. M., Peeters, Geeske, Polinder-Bos, Harmke A., Wouters, Fenne, and de Hond, Anne
Subjects: NURSING home residents, COVID-19 pandemic, OLDER people, COVID-19, PROGNOSIS, DEATH forecasting
Abstract: Background: During the COVID-19 pandemic, nursing home (NH) residents faced the highest risk of severe COVID-19 disease and mortality. Due to their frailty status, comorbidity burden can serve as a useful predictive indicator of vulnerability in this frail population. However, the prognostic value of these cumulative comorbidity scores like the Charlson comorbidity index (CCI) remained unclear in this population. We evaluated the incremental predictive value of the CCI for predicting 28-day mortality in NH residents with COVID-19, compared to prediction using age and sex only. Methods: We included older individuals of ≥ 70 years of age in a large retrospective observational cohort across NHs in the Netherlands. Individuals with PCR-confirmed COVID-19 diagnosis from 1 March 2020 to 31 December 2021 were included. The CCI score was computed by searching for the comorbidities recorded in the electronic patient records. All-cause mortality within 28 days was predicted using logistic regression based on age and sex only (base model) and by adding the CCI to the base model (CCI model). The predictive performance of the base model and the CCI model were compared visually by the distribution of predicted risks and area under the receiver operator characteristic curve (AUROC), scaled Brier score, and calibration slope. Results: A total of 4318 older NH residents were included in this study with a median age of 88 years [IQR: 83–93] and a median CCI score of 6 [IQR: 5–7]. 1357 (31%) residents died within 28 days after COVID-19 diagnosis. The base model, with age and sex as predictors, had an AUROC of 0.61 (CI: 0.60 to 0.63), a scaled brier score of 0.03 (CI: 0.02 to 0.04), and a calibration slope of 0.97 (CI: 0.83 to 1.13). The addition of CCI did not improve these predictive performance measures. Conclusion: The addition of the CCI as a vulnerability indicator did not improve short-term mortality prediction in NH residents. Similar (high) age and number of comorbidities in the NH population could reduce the effectiveness of these predictors, emphasizing the need for other population-specific predictors that can be utilized in the frail NH residents. [ABSTRACT FROM AUTHOR]
Published: 2025
Full Text: View/download PDF

50. Risk‐Based Decision Making: Estimands for Sequential Prediction Under Interventions.

Author: Luijken, Kim, Morzywołek, Paweł, van Amsterdam, Wouter, Cinà, Giovanni, Hoogland, Jeroen, Keogh, Ruth, Krijthe, Jesse H., Magliacane, Sara, van Ommen, Thijs, Peek, Niels, Putter, Hein, van Smeden, Maarten, Sperrin, Matthew, Wang, Junfeng, Weir, Daniala L., Didelez, Vanessa, and van Geloven, Nan
Abstract: Prediction models are used among others to inform medical decisions on interventions. Typically, individuals with high risks of adverse outcomes are advised to undergo an intervention while those at low risk are advised to refrain from it. Standard prediction models do not always provide risks that are relevant to inform such decisions: for example, an individual may be estimated to be at low risk because similar individuals in the past received an intervention which lowered their risk. Therefore, prediction models supporting decisions should target risks belonging to defined intervention strategies. Previous works on prediction under interventions assumed that the prediction model was used only at one time point to make an intervention decision. In clinical practice, intervention decisions are rarely made only once: they might be repeated, deferred, and reevaluated. This requires estimated risks under interventions that can be reconsidered at several potential decision moments. In the current work, we highlight key considerations for formulating estimands in sequential prediction under interventions that can inform such intervention decisions. We illustrate these considerations by giving examples of estimands for a case study about choosing between vaginal delivery and cesarean section for women giving birth. Our formalization of prediction tasks in a sequential, causal, and estimand context provides guidance for future studies to ensure that the right question is answered and appropriate causal estimation approaches are chosen to develop sequential prediction models that can inform intervention decisions. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

818 results on '"van Smeden, Maarten"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources