Back to Search Start Over

Extreme gradient boosting model to assess risk of central cervical lymph node metastasis in patients with papillary thyroid carcinoma: Individual prediction using SHapley Additive exPlanations.

Authors :
Zou, Ying
Shi, Yan
Sun, Fang
Liu, Jihua
Guo, Yu
Zhang, Huanlei
Lu, Xiudi
Gong, Yan
Xia, Shuang
Source :
Computer Methods & Programs in Biomedicine. Oct2022, Vol. 225, pN.PAG-N.PAG. 1p.
Publication Year :
2022

Abstract

• The constructed eXtreme gradient boosting model could individually assess central cervical lymph node metastasis in patients with papillary thyroid carcinoma. • SHapley Additive exPlanations provided a rational explanation for predicting central cervical lymph node metastasis, including positive and negative effects. • Due to visualization advantages, the prediction performance of the eXtreme gradient boosting model was better than traditional machine-learning models. Central cervical lymph node metastasis (CLNM) is considered a risk factor for recurrence in patients with papillary thyroid carcinoma (PTC). Traditional machine learning models suffered from "black-box" problems, which could not exactly explain the interactive effects of the risk factors. We aimed to develop an eXtreme Gradient Boosting (XGBoost) model to assess CLNM, including positive and negative effects. 1,122 patients with PTC admitted at Tianjin First Central Hospital from 2016 to 2020 were retrospectively selected. They were randomly divided into the training and test datasets with an 8:2 ratio. 108 patients with PTC admitted at Binzhou Medical University Hospital in 2020 served as the validation dataset. The XGBoost model was used to assess CLNM. The 10-fold cross-validation was utilized for model selection, and the metric used to evaluate classification performance was the average area under the curve (AUC) of 10-fold cross-validation. Interpretation and transparency of the "black-box" problem were performed. SHapley Additive exPlanations (SHAP) and local interpretable model-agnostic explanation (LIME) were used to ensure the stability and reliability of the model. The XGBoost model based on ultrasound and dual-energy computed tomography images of the solitary primary lesion had an excellent performance for assessing CLNM, with average AUCs of 0.918, 0.903, and 0.881 in the training, test, and validation datasets, respectively. SHAP plots showed the influence of each parameter on the XGBoost model, including positive (i.e., capsular invasion, diameter, iodine concentration in the venous phase, and calcification) and negative (i.e., sex and age) impacts. For all cases, the capsular invasion prediction weight was the highest; for individual cases, different predictors were assigned different weights. Moreover, the performance of the XGBoost model was better than classical machine-learning models. This study developed and validated an XGBoost model for assessing CLNM in patients with PTC. The ability to visually interpret the positive and negative effects made the XGBoost model an effective tool for guiding clinical treatment. [Display omitted] [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
01692607
Volume :
225
Database :
Academic Search Index
Journal :
Computer Methods & Programs in Biomedicine
Publication Type :
Academic Journal
Accession number :
159217004
Full Text :
https://doi.org/10.1016/j.cmpb.2022.107038