Back to Search
Start Over
Developing prediction models to estimate the risk of two survival outcomes both occurring: A comparison of techniques
- Source :
- Pate, A, Sperrin, M, Riley, R D, Sergeant, J C, Van Staa, T, Peek, N, Mamas, M A, Lip, G Y H, O'Flaherty, M, Buchan, I & Martin, G P 2023, ' Developing prediction models to estimate the risk of two survival outcomes both occurring : A comparison of techniques ', Statistics in Medicine . https://doi.org/10.1002/sim.9771
- Publication Year :
- 2023
- Publisher :
- Wiley, 2023.
-
Abstract
- IntroductionThis study considers the prediction of the time until two survival outcomes have both occurred. We compared a variety of analytical methods motivated by a typical clinical problem of multimorbidity prognosis.MethodsWe considered five methods: product (multiply marginal risks), dual-outcome (directly model the time until both events occur), multistate models (msm), and a range of copula and frailty models. We assessed calibration and discrimination under a variety of simulated data scenarios, varying outcome prevalence, and the amount of residual correlation. The simulation focused on model misspecification and statistical power. Using data from the Clinical Practice Research Datalink, we compared model performance when predicting the risk of cardiovascular disease and type 2 diabetes both occurring.ResultsDiscrimination was similar for all methods. The product method was poorly calibrated in the presence of residual correlation. The msm and dual-outcome models were the most robust to model misspecification but suffered a drop in performance at small sample sizes due to overfitting, which the copula and frailty model were less susceptible to. The copula and frailty model's performance were highly dependent on the underlying data structure. In the clinical example, the product method was poorly calibrated when adjusting for 8 major cardiovascular risk factors.DiscussionWe recommend the dual-outcome method for predicting the risk of two survival outcomes both occurring. It was the most robust to model misspecification, although was also the most prone to overfitting. The clinical example motivates the use of the methods considered in this study.
Details
- ISSN :
- 10970258 and 02776715
- Database :
- OpenAIRE
- Journal :
- Statistics in Medicine
- Accession number :
- edsair.doi.dedup.....11e9b1003323886926919dd587d583c3
- Full Text :
- https://doi.org/10.1002/sim.9771