Back to Search Start Over

The shape of cancer relapse: Topological data analysis predicts recurrence in paediatric acute lymphoblastic leukaemia.

Authors :
Salvador Chulián
Bernadette J Stolz
Álvaro Martínez-Rubio
Cristina Blázquez Goñi
Juan F Rodríguez Gutiérrez
Teresa Caballero Velázquez
Águeda Molinos Quintana
Manuel Ramírez Orellana
Ana Castillo Robleda
José Luis Fuster Soler
Alfredo Minguela Puras
María V Martínez Sánchez
María Rosa
Víctor M Pérez-García
Helen M Byrne
Source :
PLoS Computational Biology, Vol 19, Iss 8, p e1011329 (2023)
Publication Year :
2023
Publisher :
Public Library of Science (PLoS), 2023.

Abstract

Although children and adolescents with acute lymphoblastic leukaemia (ALL) have high survival rates, approximately 15-20% of patients relapse. Risk of relapse is routinely estimated at diagnosis by biological factors, including flow cytometry data. This high-dimensional data is typically manually assessed by projecting it onto a subset of biomarkers. Cell density and "empty spaces" in 2D projections of the data, i.e. regions devoid of cells, are then used for qualitative assessment. Here, we use topological data analysis (TDA), which quantifies shapes, including empty spaces, in data, to analyse pre-treatment ALL datasets with known patient outcomes. We combine these fully unsupervised analyses with Machine Learning (ML) to identify significant shape characteristics and demonstrate that they accurately predict risk of relapse, particularly for patients previously classified as 'low risk'. We independently confirm the predictive power of CD10, CD20, CD38, and CD45 as biomarkers for ALL diagnosis. Based on our analyses, we propose three increasingly detailed prognostic pipelines for analysing flow cytometry data from ALL patients depending on technical and technological availability: 1. Visual inspection of specific biological features in biparametric projections of the data; 2. Computation of quantitative topological descriptors of such projections; 3. A combined analysis, using TDA and ML, in the four-parameter space defined by CD10, CD20, CD38 and CD45. Our analyses readily extend to other haematological malignancies.

Subjects

Subjects :
Biology (General)
QH301-705.5

Details

Language :
English
ISSN :
1553734X and 15537358
Volume :
19
Issue :
8
Database :
Directory of Open Access Journals
Journal :
PLoS Computational Biology
Publication Type :
Academic Journal
Accession number :
edsdoj.98c5115ec1784f58bba7bc06fea347cf
Document Type :
article
Full Text :
https://doi.org/10.1371/journal.pcbi.1011329