Back to Search Start Over

Multi-modal transcriptomics: integrating machine learning and convolutional neural networks to identify immune biomarkers in atherosclerosis

Authors :
Haiqing Chen
Haotian Lai
Hao Chi
Wei Fan
Jinbang Huang
Shengke Zhang
Chenglu Jiang
Lai Jiang
Qingwen Hu
Xiuben Yan
Yemeng Chen
Jieying Zhang
Guanhu Yang
Bin Liao
Juyi Wan
Source :
Frontiers in Cardiovascular Medicine, Vol 11 (2024)
Publication Year :
2024
Publisher :
Frontiers Media S.A., 2024.

Abstract

BackgroundAtherosclerosis, a complex chronic vascular disorder with multifactorial etiology, stands as the primary culprit behind consequential cardiovascular events, imposing a substantial societal and economic burden. Nevertheless, our current understanding of its pathogenesis remains imprecise. In this investigation, our objective is to establish computational models elucidating molecular-level markers associated with atherosclerosis. This endeavor involves the integration of advanced machine learning techniques and comprehensive bioinformatics analyses.Materials and methodsOur analysis incorporated data from three publicly available the Gene Expression Omnibus (GEO) datasets: GSE100927 (104 samples, 30,558 genes), which includes atherosclerotic lesions and control arteries from carotid, femoral, and infra-popliteal arteries of deceased organ donors; GSE43292 (64 samples, 23,307 genes), consisting of paired carotid endarterectomy samples from 32 hypertensive patients, comparing atheroma plaques and intact tissues; and GSE159677 (30,498 single cells, 33,538 genes), examining single-cell transcriptomes of calcified atherosclerotic core plaques and adjacent carotid artery tissues from patients undergoing carotid endarterectomy. Utilizing single-cell sequencing, highly variable atherosclerotic monocyte subpopulations were systematically identified. We analyzed cellular communication patterns with temporal dynamics. The bioinformatics approach Weighted Gene Co—expression Network Analysis (WGCNA) identified key modules, constructing a Protein-Protein Interaction (PPI) network from module-associated genes. Three machine-learning models derived marker genes, formulated through logistic regression and validated via convolutional neural network(CNN) modeling. Subtypes were clustered based on Gene Set Variation Analysis (GSVA) scores, validated through immunoassays.ResultsThree pivotal atherosclerosis-associated genes—CD36, S100A10, CSNK1A1—were unveiled, offering valuable clinical insights. Profiling based on these genes delineated two distinct isoforms: C2 demonstrated potent microbicidal activity, while C1 engaged in inflammation regulation, tissue repair, and immune homeostasis. Molecular docking analyses explored therapeutic potential for Estradiol, Zidovudine, Indinavir, and Dronabinol for clinical applications.ConclusionThis study introduces three signature genes for atherosclerosis, shaping a novel paradigm for investigating clinical immunological medications. It distinguishes the high biocidal C2 subtype from the inflammation-modulating C1 subtype, utilizing identified signature gene as crucial targets.

Details

Language :
English
ISSN :
2297055X
Volume :
11
Database :
Directory of Open Access Journals
Journal :
Frontiers in Cardiovascular Medicine
Publication Type :
Academic Journal
Accession number :
edsdoj.68045dfb63af439797fd1e7aeba43da8
Document Type :
article
Full Text :
https://doi.org/10.3389/fcvm.2024.1397407