Back to Search
Start Over
Geptop: A Gene Essentiality Prediction Tool for Sequenced Bacterial Genomes Based on Orthology and Phylogeny
- Source :
- PLoS ONE, PLoS ONE, Vol 8, Iss 8, p e72343 (2013)
- Publication Year :
- 2013
- Publisher :
- Public Library of Science (PLoS), 2013.
-
Abstract
- Integrative genomics predictors, which score highly in predicting bacterial essential genes, would be unfeasible in most species because the data sources are limited. We developed a universal approach and tool designated Geptop, based on orthology and phylogeny, to offer gene essentiality annotations. In a series of tests, our Geptop method yielded higher area under curve (AUC) scores in the receiver operating curves than the integrative approaches. In the ten-fold cross-validations among randomly upset samples, Geptop yielded an AUC of 0.918, and in the cross-organism predictions for 19 organisms Geptop yielded AUC scores between 0.569 and 0.959. A test applied to the very recently determined essential gene dataset from the Porphyromonas gingivalis, which belongs to a phylum different with all of the above 19 bacterial genomes, gave an AUC of 0.77. Therefore, Geptop can be applied to any bacterial species whose genome has been sequenced. Compared with the essential genes uniquely identified by the lethal screening, the essential genes predicted only by Gepop are associated with more protein-protein interactions, especially in the three bacteria with lower AUC scores (
- Subjects :
- Proteomics
Genome evolution
Gene prediction
lcsh:Medicine
Bacterial genome size
Biology
Gram-Positive Bacteria
Biochemistry
Microbiology
Genome
Bacterial Proteins
Genome Analysis Tools
Phylogenetics
Gram-Negative Bacteria
Protein Interaction Mapping
Genetics
Evolutionary Systematics
lcsh:Science
Theoretical Biology
Mathematical Computing
Gene
Phylogeny
Evolutionary Biology
Genes, Essential
Multidisciplinary
Phylum
lcsh:R
Statistics
Computational Biology
Reproducibility of Results
Molecular Sequence Annotation
Genomics
ROC Curve
Essential gene
Area Under Curve
lcsh:Q
Mathematics
Genome, Bacterial
Software
Research Article
Subjects
Details
- ISSN :
- 19326203
- Volume :
- 8
- Database :
- OpenAIRE
- Journal :
- PLoS ONE
- Accession number :
- edsair.doi.dedup.....635f141a8ad8817f48602c5febc06c49
- Full Text :
- https://doi.org/10.1371/journal.pone.0072343