Back to Search
Start Over
A network-based approach to classify the three domains of life
- Source :
- Biology Direct, Vol 6, Iss 1, p 53 (2011), Biology Direct
- Publisher :
- Springer Nature
-
Abstract
- Background Identifying group-specific characteristics in metabolic networks can provide better insight into evolutionary developments. Here, we present an approach to classify the three domains of life using topological information about the underlying metabolic networks. These networks have been shown to share domain-independent structural similarities, which pose a special challenge for our endeavour. We quantify specific structural information by using topological network descriptors to classify this set of metabolic networks. Such measures quantify the structural complexity of the underlying networks. In this study, we use such measures to capture domain-specific structural features of the metabolic networks to classify the data set. So far, it has been a challenging undertaking to examine what kind of structural complexity such measures do detect. In this paper, we apply two groups of topological network descriptors to metabolic networks and evaluate their classification performance. Moreover, we combine the two groups to perform a feature selection to estimate the structural features with the highest classification ability in order to optimize the classification performance. Results By combining the two groups, we can identify seven topological network descriptors that show a group-specific characteristic by ANOVA. A multivariate analysis using feature selection and supervised machine learning leads to a reasonable classification performance with a weighted F-score of 83.7% and an accuracy of 83.9%. We further demonstrate that our approach outperforms alternative methods. Also, our results reveal that entropy-based descriptors show the highest classification ability for this set of networks. Conclusions Our results show that these particular topological network descriptors are able to capture domain-specific structural characteristics for classifying metabolic networks between the three domains of life.
- Subjects :
- Topological information
Immunology
Feature selection
Biology
Machine learning
computer.software_genre
Bioinformatics
General Biochemistry, Genetics and Molecular Biology
Structural complexity
Set (abstract data type)
Computer graphics
Software
Artificial Intelligence
Three-domain system
Computer Graphics
lcsh:QH301-705.5
Ecology, Evolution, Behavior and Systematics
Analysis of Variance
Bacteria
Agricultural and Biological Sciences(all)
business.industry
Biochemistry, Genetics and Molecular Biology(all)
Research
Applied Mathematics
Eukaryota
Reproducibility of Results
Archaea
Data set
Logistic Models
ComputingMethodologies_PATTERNRECOGNITION
lcsh:Biology (General)
Modeling and Simulation
Artificial intelligence
General Agricultural and Biological Sciences
business
computer
Algorithms
Metabolic Networks and Pathways
Subjects
Details
- Language :
- English
- ISSN :
- 17456150
- Volume :
- 6
- Issue :
- 1
- Database :
- OpenAIRE
- Journal :
- Biology Direct
- Accession number :
- edsair.doi.dedup.....a1c0ef9a7285c8a41f4baba514956cfd
- Full Text :
- https://doi.org/10.1186/1745-6150-6-53