Back to Search Start Over

New assembly, reannotation and analysis of the Entamoeba histolytica genome reveal new genomic features and protein content information.

Authors :
Hernan A Lorenzi
Daniela Puiu
Jason R Miller
Lauren M Brinkac
Paolo Amedeo
Neil Hall
Elisabet V Caler
Source :
PLoS Neglected Tropical Diseases, Vol 4, Iss 6, p e716 (2010)
Publication Year :
2010
Publisher :
Public Library of Science (PLoS), 2010.

Abstract

In order to maintain genome information accurately and relevantly, original genome annotations need to be updated and evaluated regularly. Manual reannotation of genomes is important as it can significantly reduce the propagation of errors and consequently diminishes the time spent on mistaken research. For this reason, after five years from the initial submission of the Entamoeba histolytica draft genome publication, we have re-examined the original 23 Mb assembly and the annotation of the predicted genes.The evaluation of the genomic sequence led to the identification of more than one hundred artifactual tandem duplications that were eliminated by re-assembling the genome. The reannotation was done using a combination of manual and automated genome analysis. The new 20 Mb assembly contains 1,496 scaffolds and 8,201 predicted genes, of which 60% are identical to the initial annotation and the remaining 40% underwent structural changes. Functional classification of 60% of the genes was modified based on recent sequence comparisons and new experimental data. We have assigned putative function to 3,788 proteins (46% of the predicted proteome) based on the annotation of predicted gene families, and have identified 58 protein families of five or more members that share no homology with known proteins and thus could be entamoeba specific. Genome analysis also revealed new features such as the presence of segmental duplications of up to 16 kb flanked by inverted repeats, and the tight association of some gene families with transposable elements.This new genome annotation and analysis represents a more refined and accurate blueprint of the pathogen genome, and provides an upgraded tool as reference for the study of many important aspects of E. histolytica biology, such as genome evolution and pathogenesis.

Details

Language :
English
ISSN :
19352727 and 19352735
Volume :
4
Issue :
6
Database :
Directory of Open Access Journals
Journal :
PLoS Neglected Tropical Diseases
Publication Type :
Academic Journal
Accession number :
edsdoj.7a63ab065e1a4aea96f33f13e6d6e192
Document Type :
article
Full Text :
https://doi.org/10.1371/journal.pntd.0000716