Back to Search
Start Over
Development and Internal Validation of Machine Learning Algorithms for Preoperative Survival Prediction of Extremity Metastatic Disease
- Source :
- Clin Orthop Relat Res, Clinical orthopaedics and related research, 478(2), 322-333. Springer New York
- Publication Year :
- 2020
-
Abstract
- Background A preoperative estimation of survival is critical for deciding on the operative management of metastatic bone disease of the extremities. Several tools have been developed for this purpose, but there is room for improvement. Machine learning is an increasingly popular and flexible method of prediction model building based on a data set. It raises some skepticism, however, because of the complex structure of these models. Questions/purposes The purposes of this study were (1) to develop machine learning algorithms for 90-day and 1-year survival in patients who received surgical treatment for a bone metastasis of the extremity, and (2) to use these algorithms to identify those clinical factors (demographic, treatment related, or surgical) that are most closely associated with survival after surgery in these patients. Methods All 1090 patients who underwent surgical treatment for a long-bone metastasis at two institutions between 1999 and 2017 were included in this retrospective study. The median age of the patients in the cohort was 63 years (interquartile range [IQR] 54 to 72 years), 56% of patients (610 of 1090) were female, and the median BMI was 27 kg/m (IQR 23 to 30 kg/m). The most affected location was the femur (70%), followed by the humerus (22%). The most common primary tumors were breast (24%) and lung (23%). Intramedullary nailing was the most commonly performed type of surgery (58%), followed by endoprosthetic reconstruction (22%), and plate screw fixation (14%). Missing data were imputed using the missForest methods. Features were selected by random forest algorithms, and five different models were developed on the training set (80% of the data): stochastic gradient boosting, random forest, support vector machine, neural network, and penalized logistic regression. These models were chosen as a result of their classification capability in binary datasets. Model performance was assessed on both the training set and the validation set (20% of the data) by discrimination, calibration, and overall performance. Results We found no differences among the five models for discrimination, with an area under the curve ranging from 0.86 to 0.87. All models were well calibrated, with intercepts ranging from -0.03 to 0.08 and slopes ranging from 1.03 to 1.12. Brier scores ranged from 0.13 to 0.14. The stochastic gradient boosting model was chosen to be deployed as freely available web-based application and explanations on both a global and an individual level were provided. For 90-day survival, the three most important factors associated with poorer survivorship were lower albumin level, higher neutrophil-to-lymphocyte ratio, and rapid growth primary tumor. For 1-year survival, the three most important factors associated with poorer survivorship were lower albumin level, rapid growth primary tumor, and lower hemoglobin level. Conclusions Although the final models must be externally validated, the algorithms showed good performance on internal validation. The final models have been incorporated into a freely accessible web application that can be found at https://sorg-apps.shinyapps.io/extremitymetssurvival/. Pending external validation, clinicians may use this tool to predict survival for their individual patients to help in shared treatment decision making. Level of evidence Level III, therapeutic study.
- Subjects :
- Male
Time Factors
Clinical Decision-Making
Other Features
Bone Neoplasms
Machine learning
computer.software_genre
Logistic regression
Risk Assessment
Decision Support Techniques
Machine Learning
Interquartile range
Predictive Value of Tests
Risk Factors
Survivorship curve
Medicine
Humans
Orthopedics and Sports Medicine
Orthopedic Procedures
Aged
Retrospective Studies
business.industry
Patient Selection
Reproducibility of Results
Retrospective cohort study
General Medicine
Middle Aged
Missing data
medicine.disease
Primary tumor
Treatment Outcome
Predictive value of tests
Cohort
Surgery
Female
Artificial intelligence
business
computer
Algorithm
Boston
Subjects
Details
- Language :
- English
- ISSN :
- 0009921X
- Volume :
- 478
- Issue :
- 2
- Database :
- OpenAIRE
- Journal :
- Clinical orthopaedics and related research
- Accession number :
- edsair.doi.dedup.....39b38f1ff48c2e83e03ac7462c9c7a1e