1. DNA structural and physical properties reveal peculiarities in promoter sequences of the bacterium Escherichia coli K-12
- Author
-
Gustavo Sganzerla Martinez, Ernesto Pérez-Rueda, Aditya Kumar, and Scheila de Avila e Silva
- Subjects
Technology ,Science ,General Chemical Engineering ,General Physics and Astronomy ,Computational biology ,Biology ,medicine.disease_cause ,03 medical and health sciences ,chemistry.chemical_compound ,Base-pair stacking ,0302 clinical medicine ,Intergenic region ,Enthalpy ,Sigma factor ,Transcription factors ,medicine ,Coding region ,General Materials Science ,Nucleotide ,Binding site ,Escherichia coli ,Transcription factor ,030304 developmental biology ,General Environmental Science ,chemistry.chemical_classification ,0303 health sciences ,Bacteria ,General Engineering ,Free-stability ,Gene transcription ,chemistry ,General Earth and Planetary Sciences ,030217 neurology & neurosurgery ,DNA - Abstract
The gene transcription of bacteria starts with a promoter sequence being recognized by a transcription factor found in the RNAP enzyme, this process is assisted through the conservation of nucleotides as well as other factors governing these intergenic regions. Faced with this, the coding of genetic information into physical aspects of the DNA such as enthalpy, stability, and base-pair stacking could suggest promoter activity as well as protrude differentiation of promoter and non-promoter data. In this work, a total of 3131 promoter sequences associated to six different sigma factors in the bacterium E. coli were converted into numeric attributes, a strong set of control sequences referring to a shuffled version of the original sequences as well as coding regions is provided. Then, the parameterized genetic information was normalized, exhaustively analyzed through statistical tests. The results suggest that strong signals in the promoter sequences match the binding site of transcription factor proteins, indicating that promoter activity is well represented by its conversion into physical attributes. Moreover, the features tested in this report conveyed significant variances between promoter and control data, enabling these features to be employed in bacterial promoter classification. The results produced here may aid in bacterial promoter recognition by providing a robust set of biological inferences.
- Published
- 2021