Back to Search Start Over

Phylogenetic Analysis: A Novel Method of Protein Sequence Similarity Analysis.

Authors :
Li, Wei
Yang, Lina
Meng, Zuqiang
Qiu, Yu
Wang, Patrick Shen-Pei
Li, Xichun
Source :
International Journal of Pattern Recognition & Artificial Intelligence. Aug2022, Vol. 36 Issue 10, p1-25. 25p.
Publication Year :
2022

Abstract

Protein sequence similarity analysis (PSSA) is a significant task in bioinformatics, which can obtain information about unknown sequences such as protein structures and homology relationships. Protein sequence refers to the series of amino acids with rich physical and chemical properties, namely the basic structure of proteins. However, sequence similarity analysis and phylogenetic analysis between different species which have complex amino acid sequences is a challenging problem. In this paper, nine properties of amino acids were considered and the sequence was converted into numerical values by principal component analysis (PCA); with Haar Wavelet Transform, and Higuchi fractal dimension (HFD), a new feature vector is constructed to represent the sequence; Spearman distance was selected to calculate the distance matrix and the phylogenetic tree was constructed. In this paper, two representative protein sequences (9 ND5 (NADH dehydrogenase 5) and 8 ND6 (NADH dehydrogenase 6)) were selected for similarity analysis and phylogenetic analysis, and compared with MEGA software and other existing methods. The extensive results show that our method is outperforming and results consistent with the known facts. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
02180014
Volume :
36
Issue :
10
Database :
Academic Search Index
Journal :
International Journal of Pattern Recognition & Artificial Intelligence
Publication Type :
Academic Journal
Accession number :
159106403
Full Text :
https://doi.org/10.1142/S0218001422580071