1. Automatic magnetic resonance imaging series labelling for large repositories.
- Author
-
Gomis-Maya, Armando, Cerdá-Alberich, Leonor, Veiga-Canuto, Diana, Claudio-Fanni, Salvatore, Ten-Steve, Amadeo, Ribas-Despuig, Gloria, Mallol-Roselló, Pedro José, Vila-Frances, Joan, and Marti-Bonmati, Luis
- Subjects
MAGNETIC resonance imaging ,MACHINE learning ,ARTIFICIAL intelligence ,IMAGE processing ,DATA mining - Abstract
Large medical image repositories present challenges related to unstructured data. A data enrichment process allows the storage of additional information for fast identification of the content and properties of medical imaging studies. The aim of this study is to develop a metadata enrichment pipeline to facilitate the secondary use of medical images in a high-throughput environment. Our aim was to develop a categorization tool for the MR series to generate standardized tags that identify relevant image characteristics such as patient orientation, sequence type, weighting type, or the presence of fat suppression. Three models that make use of machine learning (ML) and DICOM tags are proposed. The dataset for their development consists of 4666 MR series from cancer patients, labeled by expert radiologists and acquired from different manufacturers, clinical centers and anatomical regions, covering as much variability as possible with the aim of making the models generalizable to other databases. Moreover, the inference performance of the end system has been evaluated on 25,596 MR series as well as the final model outputs with an external evaluation set of 1286 MR series. The weighting model achieves very reliable results with a f1 score of 88% in the validation set. Junk and chemical shift models achieved scores of 82% and 83% respectively. These results open the door to the automatic application of image post-processing and deep learning algorithms after accurate labeling, minimizing human intervention. Furthermore, the proposed solution can infer thousands of DICOM series in less than one minute. Thanks to the fast inference times provided by this solution, it fits well in a big data ecosystem, eliminating any performance issues on ingestion in a semi-real-time environment. [ABSTRACT FROM AUTHOR]
- Published
- 2025
- Full Text
- View/download PDF