Back to Search Start Over

Differentiation between descending thoracic aortic diseases using machine learning and plasma proteomic signatures.

Authors :
Momenzadeh A
Kreimer S
Guo D
Ayres M
Berman D
Chyu KY
Shah PK
Milewicz D
Azizzadeh A
Meyer JG
Parker S
Source :
Clinical proteomics [Clin Proteomics] 2024 Jun 02; Vol. 21 (1), pp. 38. Date of Electronic Publication: 2024 Jun 02.
Publication Year :
2024

Abstract

Background: Descending thoracic aortic aneurysms and dissections can go undetected until severe and catastrophic, and few clinical indices exist to screen for aneurysms or predict risk of dissection.<br />Methods: This study generated a plasma proteomic dataset from 75 patients with descending type B dissection (Type B) and 62 patients with descending thoracic aortic aneurysm (DTAA). Standard statistical approaches were compared to supervised machine learning (ML) algorithms to distinguish Type B from DTAA cases. Quantitatively similar proteins were clustered based on linkage distance from hierarchical clustering and ML models were trained with uncorrelated protein lists across various linkage distances with hyperparameter optimization using fivefold cross validation. Permutation importance (PI) was used for ranking the most important predictor proteins of ML classification between disease states and the proteins among the top 10 PI protein groups were submitted for pathway analysis.<br />Results: Of the 1,549 peptides and 198 proteins used in this study, no peptides and only one protein, hemopexin (HPX), were significantly different at an adjusted pā€‰<ā€‰0.01 between Type B and DTAA cases. The highest performing model on the training set (Support Vector Classifier) and its corresponding linkage distance (0.5) were used for evaluation of the test set, yielding a precision-recall area under the curve of 0.7 to classify between Type B from DTAA cases. The five proteins with the highest PI scores were immunoglobulin heavy variable 6-1 (IGHV6-1), lecithin-cholesterol acyltransferase (LCAT), coagulation factor 12 (F12), HPX, and immunoglobulin heavy variable 4-4 (IGHV4-4). All proteins from the top 10 most important groups generated the following significantly enriched pathways in the plasma of Type B versus DTAA patients: complement activation, humoral immune response, and blood coagulation.<br />Conclusions: We conclude that ML may be useful in differentiating the plasma proteome of highly similar disease states that would otherwise not be distinguishable using statistics, and, in such cases, ML may enable prioritizing important proteins for model prediction.<br /> (© 2024. The Author(s).)

Details

Language :
English
ISSN :
1542-6416
Volume :
21
Issue :
1
Database :
MEDLINE
Journal :
Clinical proteomics
Publication Type :
Academic Journal
Accession number :
38825704
Full Text :
https://doi.org/10.1186/s12014-024-09487-4