Back to Search
Start Over
Single-nucleotide conservation state annotation of the SARS-CoV-2 genome
- Source :
- Communications biology, vol 4, iss 1, bioRxiv, article-version (status) pre, article-version (number) 1, Communications Biology, Communications Biology, Vol 4, Iss 1, Pp 1-11 (2021)
- Publication Year :
- 2021
- Publisher :
- eScholarship, University of California, 2021.
-
Abstract
- Given the global impact and severity of COVID-19, there is a pressing need for a better understanding of the SARS-CoV-2 genome and mutations. Multi-strain sequence alignments of coronaviruses (CoV) provide important information for interpreting the genome and its variation. We apply a comparative genomics method, ConsHMM, to the multi-strain alignments of CoV to annotate every base of the SARS-CoV-2 genome with conservation states based on sequence alignment patterns among CoV. The learned conservation states show distinct enrichment patterns for genes, protein domains, and other regions of interest. Certain states are strongly enriched or depleted of SARS-CoV-2 mutations, which can be used to predict potentially consequential mutations. We expect the conservation states to be a resource for interpreting the SARS-CoV-2 genome and mutations.<br />Kwon and Ernst applied the comparative genomics method, ConsHMM, to the multi-strain alignments of different coronaviruses in order to annotate every base of the SARS-CoV-2 genome with conservation states. The conservation states reflect sequence alignment patterns among different coronaviruses, which would assist with understanding the functional consequences of SARS-CoV-2 mutations.
- Subjects :
- viruses
Medicine (miscellaneous)
Evolutionary biology
medicine.disease_cause
Genome
Conserved sequence
0302 clinical medicine
Viral
Biology (General)
Lung
Conserved Sequence
0303 health sciences
Mutation
Nucleotides
virus diseases
Genomics
Infectious Diseases
Sequence annotation
Pneumonia & Influenza
Infectious diseases
General Agricultural and Biological Sciences
Biotechnology
QH301-705.5
Evolution
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2)
Protein domain
Sequence alignment
Computational biology
Genome, Viral
Biology
General Biochemistry, Genetics and Molecular Biology
Article
Evolution, Molecular
Annotation
03 medical and health sciences
medicine
Genetics
Animals
Humans
Gene
030304 developmental biology
Sequence (medicine)
Comparative genomics
Base Sequence
SARS-CoV-2
fungi
Human Genome
Molecular
COVID-19
Pneumonia
biochemical phenomena, metabolism, and nutrition
respiratory tract diseases
Emerging Infectious Diseases
Sequence Alignment
030217 neurology & neurosurgery
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- Communications biology, vol 4, iss 1, bioRxiv, article-version (status) pre, article-version (number) 1, Communications Biology, Communications Biology, Vol 4, Iss 1, Pp 1-11 (2021)
- Accession number :
- edsair.doi.dedup.....d84202c6d8ef87e9fbed4c474586f7ce