1. Transcriptional Profiling and Machine Learning Unveil a Concordant Biosignature of Type I Interferon-Inducible Host Response Across Nasal Swab and Pulmonary Tissue for COVID-19 Diagnosis
- Author
-
Yibin Feng, Yigang Feng, Chi-Wing Tam, Cheng Zhang, and Ning Wang
- Subjects
diagnosis ,Immunology ,Interferome ,Biology ,Machine learning ,computer.software_genre ,Severity of Illness Index ,Virus ,Diagnosis, Differential ,Transcriptome ,COVID-19 Testing ,Interferon ,Nasopharynx ,medicine ,Humans ,Immunology and Allergy ,Gene Regulatory Networks ,Lung ,Respiratory Tract Infections ,Transcription factor ,Gene ,Original Research ,SARS-CoV-2 ,business.industry ,Gene Expression Profiling ,COVID-19 ,RC581-607 ,ISG15 ,machine learning ,Nasal Swab ,Interferon Type I ,type I interferon ,Artificial intelligence ,Immunologic diseases. Allergy ,business ,computer ,medicine.drug - Abstract
BackgroundCOVID-19, caused by SARS-CoV-2 virus, is a global pandemic with high mortality and morbidity. Limited diagnostic methods hampered the infection control. Since the direct detection of virus mainly by RT-PCR may cause false-negative outcome, host response-dependent testing may serve as a complementary approach for improving COVID-19 diagnosis.ObjectiveOur study discovered a highly-preserved transcriptional profile of Type I interferon (IFN-I)-dependent genes for COVID-19 complementary diagnosis.MethodsComputational language R-dependent machine learning was adopted for mining highly-conserved transcriptional profile (RNA-sequencing) across heterogeneous samples infected by SARS-CoV-2 and other respiratory infections. The transcriptomics/high-throughput sequencing data were retrieved from NCBI-GEO datasets (GSE32155, GSE147507, GSE150316, GSE162835, GSE163151, GSE171668, GSE182569). Mathematical approaches for homological analysis were as follows: adjusted rand index-related similarity analysis, geometric and multi-dimensional data interpretation, UpsetR, t-distributed Stochastic Neighbor Embedding (t-SNE), and Weighted Gene Co-expression Network Analysis (WGCNA). Besides, Interferome Database was used for predicting the transcriptional factors possessing IFN-I promoter-binding sites to the key IFN-I genes for COVID-19 diagnosis.ResultsIn this study, we identified a highly-preserved gene module between SARS-CoV-2 infected nasal swab and postmortem lung tissue regulating IFN-I signaling for COVID-19 complementary diagnosis, in which the following 14 IFN-I-stimulated genes are highly-conserved, including BST2, IFIT1, IFIT2, IFIT3, IFITM1, ISG15, MX1, MX2, OAS1, OAS2, OAS3, OASL, RSAD2, and STAT1. The stratified severity of COVID-19 may also be identified by the transcriptional level of these 14 IFN-I genes.ConclusionUsing transcriptional and computational analysis on RNA-seq data retrieved from NCBI-GEO, we identified a highly-preserved 14-gene transcriptional profile regulating IFN-I signaling in nasal swab and postmortem lung tissue infected by SARS-CoV-2. Such a conserved biosignature involved in IFN-I-related host response may be leveraged for COVID-19 diagnosis.
- Published
- 2021
- Full Text
- View/download PDF