Back to Search Start Over

Hundreds of out-of-frame remodelled gene families in the E. coli pangenome

Authors :
Watson, Andrew
Lopez, Philippe
Bapteste, Eric
Institut de Systématique, Evolution, Biodiversité (ISYEB )
Muséum national d'Histoire naturelle (MNHN)-École pratique des hautes études (EPHE)
Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)-Université des Antilles (UA)
Source :
Molecular Biology and Evolution, Molecular Biology and Evolution, Oxford University Press (OUP), 2021, pp.msab329. ⟨10.1093/molbev/msab329/6430988⟩
Publication Year :
2021
Publisher :
HAL CCSD, 2021.

Abstract

International audience; All genomes include gene families with very limited taxonomic distributions that potentially represent new genes and innovations in protein-coding sequence, raising questions on the origins of such genes. Some of these genes are hypothesised to have formed de novo, from non-coding sequences, and recent work has begun to elucidate the processes by which de novo gene formation can occur. A special case of de novo gene formation, overprinting, describes the origin of new genes from non-coding alternative reading frames of existing open reading frames (ORFs). We argue that additionally, out-of-frame gene fission/fusion events of alternative reading frames of ORFs and out-of-frame lateral gene transfers could contribute to the origin of new gene families. To demonstrate this, we developed an original pattern-search in sequence similarity networks, enhancing the use of these graphs, commonly used to detect in-frame remodelled genes. We applied this approach to gene families in 524 complete genomes of Escherichia coli. We identified 767 gene families whose evolutionary history likely included at least one out-of-frame remodelling event. These genes with out-of-frame components represent 2.5% of all genes in the E. coli pangenome, suggesting that alternative reading frames of existing ORFs can contribute to a significant proportion of de novo genes in bacteria.

Subjects

Subjects :
[SDV]Life Sciences [q-bio]

Details

Language :
English
ISSN :
07374038 and 15371719
Database :
OpenAIRE
Journal :
Molecular Biology and Evolution, Molecular Biology and Evolution, Oxford University Press (OUP), 2021, pp.msab329. ⟨10.1093/molbev/msab329/6430988⟩
Accession number :
edsair.dedup.wf.001..e6a7aa7ab59b2c0f23ba338034c89683