1. Sources of variation in multicenter rectal MRI data and their effect on radiomics feature reproducibility
- Author
-
Simon van Kranen, Gerlof Bosma, Frans C. H. Bakers, Shira de Bie, Remy W F Geenen, Cornelis J Veeken, Gerald Peterson, Francesca Castagnoli, Doenja M. J. Lambregts, Najim El Khababi, Sander Roberti, Regina G. H. Beets-Tan, Roy F. A. Vliegen, Joost J. M. van Griethuysen, Niels W. Schurink, Vincent C. Cappendijk, Nino Bogveradze, P.A. Neijenhuis, RS: GROW - R3 - Innovative Cancer Diagnostics & Therapy, Beeldvorming, School Office GROW, and MUMC+: DA BV Medisch Specialisten Radiologie (9)
- Subjects
Reproducibility of results ,medicine.medical_specialty ,Reproducibility ,Intraclass correlation ,business.industry ,PREDICTION ,Feature extraction ,Image processing ,Rectal neoplasms ,General Medicine ,Repeatability ,CANCER ,Multicenter study ,Magnetic resonance imaging ,Feature (computer vision) ,Linear regression ,medicine ,Radiology, Nuclear Medicine and imaging ,Segmentation ,Computer-assisted ,Radiology ,Nuclear medicine ,business ,REPEATABILITY - Abstract
Objectives To investigate sources of variation in a multicenter rectal cancer MRI dataset focusing on hardware and image acquisition, segmentation methodology, and radiomics feature extraction software. Methods T2W and DWI/ADC MRIs from 649 rectal cancer patients were retrospectively acquired in 9 centers. Fifty-two imaging features (14 first-order/6 shape/32 higher-order) were extracted from each scan using whole-volume (expert/non-expert) and single-slice segmentations using two different software packages (PyRadiomics/CapTk). Influence of hardware, acquisition, and patient-intrinsic factors (age/gender/cTN-stage) on ADC was assessed using linear regression. Feature reproducibility was assessed between segmentation methods and software packages using the intraclass correlation coefficient. Results Image features differed significantly (p Conclusions Significant variations are present in multicenter MRI data, particularly related to differences in hardware and acquisition, which will likely negatively influence subsequent analysis if not corrected for. Segmentation variations had a minor impact when using whole volume segmentations. Between software packages, higher-order features were less reproducible and caution is warranted when implementing these in prediction models. Key Points • Features derived from T2W-MRI and in particular ADC differ significantly between centers when performing multicenter data analysis. • Variations in ADC are mainly (> 60%) caused by hardware and image acquisition differences and less so ( • Features derived using different image segmentations (expert/non-expert) were reproducible, provided that whole-volume segmentations were used. When using different feature extraction software packages with similar settings, higher-order features were less reproducible.
- Published
- 2022