18 results on '"Smith–Waterman algorithm"'
Search Results
2. Searching for remote homologs of CAML among eukaryotes
- Author
-
Nica Borgese
- Subjects
Saccharomyces cerevisiae Proteins ,Saccharomyces cerevisiae ,Computational biology ,Biology ,Endoplasmic Reticulum ,Biochemistry ,03 medical and health sciences ,0302 clinical medicine ,Structural Biology ,Genetics ,Homologous chromosome ,Animals ,Guanine Nucleotide Exchange Factors ,Humans ,Molecular Biology ,Cyclophilin ,030304 developmental biology ,computer.programming_language ,Adenosine Triphosphatases ,Smith–Waterman algorithm ,0303 health sciences ,Multiple sequence alignment ,Caml ,Phylum ,Intracellular Membranes ,Cell Biology ,Transmembrane protein ,Cell biology ,computer ,030217 neurology & neurosurgery ,Function (biology) ,Protein Binding - Abstract
The tryptophan rich basic protein/calcium signal-modulating cyclophilin ligand (WRB/CAML) and Get1p/Get2p complexes, in vertebrates and yeast, respectively, mediate the final step of tail-anchored protein insertion into the endoplasmic reticulum membrane via the Get pathway. While WRB appears to exist in all eukaryotes, CAML homologs were previously recognized only among chordates, raising the question as to how CAML's function is performed in other phyla. Furthermore, whereas WRB was recognized as the metazoan homolog of Get1, CAML and Get2, although functionally equivalent, were not considered to be homologous. CAML contains an N-terminal basic, TRC40/Get3-interacting, region, three transmembrane segments near the C-terminus, and a poorly conserved region between these domains. Here, I searched the NCBI protein database for remote CAML homologs in all eukaryotes, using position-specific iterated-basic local alignment search tool, with the C-terminal, the N-terminal or the full-length sequence of human CAML as query. The N-terminal basic region and full-length CAML retrieved homologs among metazoa, plants and fungi. In the latter group several hits were annotated as GET2. The C-terminal query did not return entries outside of the animal kingdom, but did retrieve over one hundred invertebrate metazoan CAML-like proteins, which all conserved the N-terminal TRC40-binding domain. The results indicate that CAML homologs exist throughout the eukaryotic domain of life, and suggest that metazoan CAML and yeast GET2 share a common evolutionary origin. They further reveal a tight link between the particular features of the metazoan membrane-anchoring domain and the TRC40-interacting region. The list of sequences presented here should provide a useful resource for future studies addressing structure-function relationships in CAML proteins.
- Published
- 2020
3. A parallel hash‐based method for local sequence alignment
- Author
-
Nezarat Amin, Emadi Sima, Ghaffari Mohammad Reza, and Aghaee-Meybodi Esmat
- Subjects
Smith–Waterman algorithm ,Local sequence alignment ,0303 health sciences ,Computer Networks and Communications ,Computer science ,0206 medical engineering ,Hash function ,Sequence alignment ,02 engineering and technology ,String searching algorithm ,DNA sequencing ,Hash table ,Computer Science Applications ,Theoretical Computer Science ,03 medical and health sciences ,Computational Theory and Mathematics ,Algorithm ,020602 bioinformatics ,Software ,030304 developmental biology - Published
- 2021
4. Coherent point drift peak alignment algorithms using distance and similarity measures for two‐dimensional gas chromatography mass spectrometry data
- Author
-
Zichun Zhong, Seongho Kim, Ikuko Kato, Zeyu Li, Sikai Zhong, and Xiang Zhang
- Subjects
Smith–Waterman algorithm ,Physics ,Biological data ,Applied Mathematics ,010401 analytical chemistry ,Point set registration ,02 engineering and technology ,Mass spectrometry ,01 natural sciences ,Measure (mathematics) ,Article ,0104 chemical sciences ,Analytical Chemistry ,020401 chemical engineering ,Similarity (network science) ,Preprocessor ,0204 chemical engineering ,F1 score ,Algorithm - Abstract
The peak alignment is a vital preprocessing step before downstream analysis, such as biomarker discovery and pathway analysis, for two-dimensional gas chromatography mass spectrometry (2DGCMS)-based metabolomics data. Due to uncontrollable experimental conditions, e.g., the differences in temperature or pressure, matrix effects on samples, and stationary phase degradation, a shift of retention times among samples inevitably occurs during 2DGCMS experiments, making it difficult to align peaks. Various peak alignment algorithms have been developed to correct retention time shifts for homogeneous, heterogeneous or both type of mass spectrometry data. However, almost all existing algorithms have been focused on a local alignment and are suffering from low accuracy especially when aligning dense biological data with many peaks. We have developed four global peak alignment (GPA) algorithms using coherent point drift (CPD) point matching algorithms: retention time-based CPD-GPA (RT), prior CPD-GPA (P), mixture CPD-GPA (M), and prior mixture CPD-GPA (P+M). The method RT performs the peak alignment based only on the retention time distance, while the methods P, M, and P+M carry out the peak alignment using both the retention time distance and mass spectral similarity. The method P incorporates the mass spectral similarity through prior information and the methods M and P+M use the mixture distance measure. Four developed algorithms are applied to homogeneous and heterogeneous spiked-in data as well as two real biological data and compared with three existing algorithms, mSPA, SWPA, and BiPACE-2D. The results show that our CPD-GPA algorithms perform better than all existing algorithms in terms of F1 score.
- Published
- 2020
5. Global existence and uniqueness of measure valued solutions to a Vlasov-type equation with local alignment
- Author
-
Yu Gao and Xiaoping Xue
- Subjects
Smith–Waterman algorithm ,0209 industrial biotechnology ,General Mathematics ,Mathematical analysis ,General Engineering ,Particle method ,02 engineering and technology ,01 natural sciences ,Measure (mathematics) ,010101 applied mathematics ,Type equation ,020901 industrial engineering & automation ,Uniqueness ,0101 mathematics ,Flocking (texture) ,Mathematics - Published
- 2017
6. An energy-aware performance analysis of SWIMM:Smith-Waterman implementation onIntel'sMulticore andManycore architectures
- Author
-
Enzo Rucci, Marcelo Naiouf, Carlos García, Armando Eduardo De Giusti, Guillermo Botella, and Manuel Prieto-Matias
- Subjects
Smith–Waterman algorithm ,Multi-core processor ,Coprocessor ,Xeon ,Computer Networks and Communications ,Computer science ,Symmetric multiprocessor system ,Parallel computing ,Computer Science Applications ,Theoretical Computer Science ,Instruction set ,Computational Theory and Mathematics ,Computer architecture ,Time complexity ,Software ,Xeon Phi - Abstract
Alignment is essential in many areas such as biological, chemical and criminal forensics. The well-known Smith-Waterman SW algorithm is able to retrieve the optimal local alignment with quadratic time and space complexity. There are several implementations that take advantage of computing parallelization, such as manycores, FPGAs or GPUs, in order to reduce the alignment effort. In this research, we adapt, develop and tune the SW algorithm named SWIMM on a heterogeneous platform based on Intel's Xeon and Xeon Phi coprocessor. SWIMM is a free tool available in a public git repositoryi¾?https://github.com/enzorucci/SWIMM. We efficiently exploit data and thread-level parallelism, reaching up to 380 GCUPS on heterogeneous architecture, 350 GCUPS for the isolated Xeon and 50 GCUPS on Xeon Phi. Despite the heterogeneous implementation obtaining the best performance, it is also the most energy-demanding. In fact, we also present a trade-off analysis between performance and power consumption. The greenest configuration is based on an isolated multicore system that exploits AVX2 instruction set architecture reaching 1.5 GCUPS/Watts. Copyright © 2015 John Wiley & Sons, Ltd.
- Published
- 2015
7. GSWABE: faster GPU-accelerated sequence alignment with optimal alignment retrieval for short DNA sequences
- Author
-
Yongchao Liu and Bertil Schmidt
- Subjects
Smith–Waterman algorithm ,Speedup ,Computer Networks and Communications ,Computer science ,Sequence alignment ,Needleman–Wunsch algorithm ,Parallel computing ,DNA sequencing ,Computer Science Applications ,Theoretical Computer Science ,Dynamic programming ,CUDA ,Computational Theory and Mathematics ,Software - Abstract
In this paper, we present GSWABE, a graphics processing unit GPU-accelerated pairwise sequence alignment algorithm for a collection of short DNA sequences. This algorithm supports all-to-all pairwise global, semi-global and local alignment, and retrieves optimal alignments on Compute Unified Device Architecture CUDA-enabled GPUs. All of the three alignment types are based on dynamic programming and share almost the same computational pattern. Thus, we have investigated a general tile-based approach to facilitating fast alignment by deeply exploring the powerful compute capability of CUDA-enabled GPUs. The performance of GSWABE has been evaluated on a Kepler-based Tesla K40 GPU using a variety of short DNA sequence datasets. The results show that our algorithm can yield a performance of up to 59.1 billions cell updates per second GCUPS, 58.5 GCUPS and 50.3 GCUPS for global, semi-global and local alignment, respectively. Furthermore, on the same system GSWABE runs up to 156.0 times faster than the Streaming SIMD Extensions SSE-based SSW library and up to 102.4 times faster than the CUDA-based MSA-CUDA the first stage in terms of local alignment. Compared with the CUDA-based gpu-pairAlign, GSWABE demonstrates stable and consistent speedups with a maximum speedup of 11.2, 10.7, and 10.6 for global, semi-global, and local alignment, respectively. Copyright © 2014 John Wiley & Sons, Ltd.
- Published
- 2014
8. MASA‐OpenCL: Parallel pruned comparison of long DNA sequences with OpenCL
- Author
-
Genaína Nunes Rodrigues, Marco Antonio C. de Figueiredo, Edans Flavius de Oliveira Sandes, Alba Cristina Magalhaes Alves de Melo, and George Teodoro
- Subjects
Smith–Waterman algorithm ,Computational Theory and Mathematics ,Computer Networks and Communications ,Computer science ,Pairwise sequence alignment ,Algorithm ,Software ,DNA sequencing ,Computer Science Applications ,Theoretical Computer Science - Published
- 2018
9. Adaptive Smith-Waterman residue match seeding for protein structural alignment
- Author
-
Mickaël Rouquier, Nathalie Tarrat, Christopher M. Topham, and Isabelle André
- Subjects
Smith–Waterman algorithm ,0303 health sciences ,Multiple sequence alignment ,Computer science ,030302 biochemistry & molecular biology ,Protein domain ,Structural alignment ,Structure space ,Sequence alignment ,Biochemistry ,Dynamic programming ,03 medical and health sciences ,Crystallography ,Structural Biology ,Pairwise comparison ,Molecular Biology ,Algorithm ,030304 developmental biology - Abstract
The POLYFIT rigid-body algorithm for automated global pairwise and multiple protein structural alignment is presented. Smith-Waterman local alignment is used to establish a set of seed equivalences that are extended using Needleman-Wunsch dynamic programming techniques. Structural and functional interaction constraints provided by evolution are encoded as one-dimensional residue physical environment strings for alignment of highly structurally overlapped protein pairs. Local structure alignment of more distantly related pairs is carried out using rigid-body conformational matching of 15-residue fragments, with allowance made for less stringent conformational matching of metal-ion and small molecule ligand-contact, disulphide bridge, and cis-peptide correspondences. Protein structural plasticity is accommodated through the stepped adjustment of a single empirical distance parameter value in the calculation of the Smith-Waterman dynamic programming matrix. Structural overlap is used both as a measure of similarity and to assess alignment quality. Pairwise alignment accuracy has been benchmarked against that of 10 widely used aligners on the Sippl and Wiederstein set of difficult pairwise structure alignment problems, and more extensively against that of Matt, SALIGN, and MUSTANG in pairwise and multiple structural alignments of protein domains with low shared sequence identity in the SCOP-ASTRAL 40% compendium. The results demonstrate the advantages of POLYFIT over other aligners in the efficient and robust identification of matching seed residue positions in distantly related protein targets and in the generation of longer structurally overlapped alignment lengths. Superposition-based application areas include comparative modeling and protein and ligand design. POLYFIT is available on the Web server at http://polyfit.insa-toulouse.fr. Proteins 2013; 81:1823-1839. (c) 2013 Wiley Periodicals, Inc.
- Published
- 2013
10. Optimization schemes and performance evaluation of Smith-Waterman algorithm on CPU, GPU and FPGA
- Author
-
Dan Zou, Fei Xia, and Yong Dou
- Subjects
Smith–Waterman algorithm ,Computer Networks and Communications ,Computer science ,Clock rate ,Graphics processing unit ,Systolic array ,Parallel computing ,CPU shielding ,computer.software_genre ,Computer Science Applications ,Theoretical Computer Science ,Computational Theory and Mathematics ,Shared memory ,Compiler ,Field-programmable gate array ,computer ,Software - Abstract
With fierce competition between CPU and graphics processing unit (GPU) platforms, performance evaluation has become the focus of various sectors. In this paper, we take a well-known algorithm in the field of biosequence matching and database searching, the Smith–Waterman (S-W) algorithm as an example, and demonstrate approaches that fully exploit its performance potentials on CPU, GPU, and field-programmable gate array (FPGA) computing platforms. For CPU platforms, we perform two optimizations, single instruction, multiple data and multithread, with compiler options, to gain over 70 × speedups over naive CPU versions on quad-core CPU platforms. For GPU platforms, we propose the combination of coalesced global memory accesses, shared memory tiles, and loop unfolding, achieving 50 × speedups over initial GPU versions on an NVIDIA GeForce GTX 470 card. Experimental results show that the GPU GTX 470 gains 12 × speedups, instead of 100 × reported by some studies, over Intel quadcore CPU Q9400, under the same manufacturing technology and both with fully optimized schemes. In addition, for FPGA platforms, we customize a linear systolic array for the S-W algorithm in a 45-nm FPGA chip from Xilinx (XC6VLX760), with up to 1024 processing elements. Under only 133 MHz clock rate, the FPGA platform reaches the highest performance and becomes the most power-efficient platform, using only 25 W compared with 190 W of the GPU GTX 470. Copyright © 2011 John Wiley & Sons, Ltd.
- Published
- 2011
11. RSLpred: an integrative system for predicting subcellular localization of rice proteins combining compositional and evolutionary information
- Author
-
Rakesh Kaundal and Gajendra P. S. Raghava
- Subjects
Proteomics ,Cytoplasm ,Sequence analysis ,Nearest neighbor search ,Computational biology ,Biology ,Biochemistry ,Evolution, Molecular ,User-Computer Interface ,Artificial Intelligence ,Sequence Analysis, Protein ,Encoding (memory) ,Computer Simulation ,Amino Acid Sequence ,Databases, Protein ,Molecular Biology ,Peptide sequence ,Plant Proteins ,Organelles ,Smith–Waterman algorithm ,Genetics ,Internet ,Reproducibility of Results ,food and beverages ,Oryza ,Support vector machine ,Benchmark (computing) ,Algorithms - Abstract
The attainment of complete map-based sequence for rice (Oryza sativa) is clearly a major milestone for the research community. Identifying the localization of encoded proteins is the key to understanding their functional characteristics and facilitating their purification. Our proposed method, RSLpred, is an effort in this direction for genome-scale subcellular prediction of encoded rice proteins. First, the support vector machine (SVM)-based modules have been developed using traditional amino acid-, dipeptide- (i+1) and four parts-amino acid composition and achieved an overall accuracy of 81.43, 80.88 and 81.10%, respectively. Secondly, a similarity search-based module has been developed using position-specific iterated-basic local alignment search tool and achieved 68.35% accuracy. Another module developed using evolutionary information of a protein sequence extracted from position-specific scoring matrix achieved an accuracy of 87.10%. In this study, a large number of modules have been developed using various encoding schemes like higher-order dipeptide composition, N- and C-terminal, splitted amino acid composition and the hybrid information. In order to benchmark RSLpred, it was tested on an independent set of rice proteins where it outperformed widely used prediction methods such as TargetP, Wolf-PSORT, PA-SUB, Plant-Ploc and ESLpred. To assist the plant research community, an online web tool 'RSLpred' has been developed for subcellular prediction of query rice proteins, which is freely accessible at http://www.imtech.res.in/raghava/rslpred.
- Published
- 2009
12. Protein structure mining using a structural alphabet
- Author
-
Narayanaswamy Srinivasan, Bernard Offmann, A.G. de Brevern, and Manoj Tyagi
- Subjects
Smith–Waterman algorithm ,0303 health sciences ,Computer science ,030302 biochemistry & molecular biology ,Structural alignment ,Structure mining ,Biochemistry ,Substitution matrix ,03 medical and health sciences ,Crystallography ,Protein structure ,Structural Biology ,Structural motif ,Molecular Biology ,Protein secondary structure ,Algorithm ,030304 developmental biology ,Block (data storage) - Abstract
We present a comprehensive evaluation of a new structure mining method called PB-ALIGN. It is based on the encoding of protein structure as 1D sequence of a combination of 16 short structural motifs or protein blocks (PBs). PBs are short motifs capable of representing most of the local structural features of a protein backbone. Using derived PB substitution matrix and simple dynamic programming algorithm, PB sequences are aligned the same way amino acid sequences to yield structure alignment. PBs are short motifs capable of representing most of the local structural features of a protein backbone. Alignment of these local features as sequence of symbols enables fast detection of structural similarities between two proteins. Ability of the method to characterize and align regions beyond regular secondary structures, for example, N and C caps of helix and loops connecting regular structures, puts it a step ahead of existing methods, which strongly rely on secondary structure elements. PB-ALIGN achieved efficiency of 85% in extracting true fold from a large database of 7259 SCOP domains and was successful in 82% cases to identify true super-family members. On comparison to 13 existing structure comparison/mining methods, PB-ALIGN emerged as the best on general ability test dataset and was at par with methods like YAKUSA and CE on nontrivial test dataset. Furthermore, the proposed method performed well when compared to flexible structure alignment method like FATCAT and outperforms in processing speed (less than 45 s per database scan). This work also establishes a reliable cut-off value for the demarcation of similar folds. It finally shows that global alignment scores of unrelated structures using PBs follow an extreme value distribution. PB-ALIGN is freely available on web server called Protein Block Expert (PBE) at http://bioinformatics.univ-reunion.fr/PBE/.
- Published
- 2007
13. Scoring profile-to-profile sequence alignments
- Author
-
Guoli Wang and Roland L. Dunbrack
- Subjects
Smith–Waterman algorithm ,Multiple sequence alignment ,Sequence database ,Sequence analysis ,Computer science ,business.industry ,Structural alignment ,Sequence alignment ,Pattern recognition ,Bioinformatics ,Sensitivity and Specificity ,Biochemistry ,Article ,Weighting ,Sequence Analysis, Protein ,Artificial intelligence ,business ,Sequence Alignment ,Molecular Biology ,Algorithms ,Alignment-free sequence analysis - Abstract
Sequence alignment profiles have been shown to be very powerful in creating accurate sequence alignments. Profiles are often used to search a sequence database with a local alignment algorithm. More accurate and longer alignments have been obtained with profile-to-profile comparison. There are several steps that must be performed in creating profile–profile alignments, and each involves choices in parameters and algorithms. These steps include (1) what sequences to include in a multiple alignment used to build each profile, (2) how to weight similar sequences in the multiple alignment and how to determine amino acid frequencies from the weighted alignment, (3) how to score a column from one profile aligned to a column of the other profile, (4) how to score gaps in the profile–profile alignment, and (5) how to include structural information. Large-scale benchmarks consisting of pairs of homologous proteins with structurally determined sequence alignments are necessary for evaluating the efficacy of each scoring scheme. With such a benchmark, we have investigated the properties of profile–profile alignments and found that (1) with optimized gap penalties, most column–column scoring functions behave similarly to one another in alignment accuracy; (2) some functions, however, have much higher search sensitivity and specificity; (3) position-specific weighting schemes in determining amino acid counts in columns of multiple sequence alignments are better than sequence-specific schemes; (4) removing positions in the profile with gaps in the query sequence results in better alignments; and (5) adding predicted and known secondary structure information improves alignments.
- Published
- 2004
14. Pairwise Local Alignment and Database Search
- Author
-
Inge Jonassen, William R. Taylor, and Ingvar Eidhammer
- Subjects
Dynamic programming ,Smith–Waterman algorithm ,Theoretical computer science ,Information retrieval ,Computer science ,Pairwise comparison ,Database search engine - Published
- 2003
15. Performance evaluation of a new algorithm for the detection of remote homologs with sequence comparison
- Author
-
Richard A. Goldstein and Maricel G. Kann
- Subjects
Smith–Waterman algorithm ,Sequence Homology, Amino Acid ,Computer science ,Proteins ,Score ,Sequence alignment ,Structural Classification of Proteins database ,Sensitivity and Specificity ,Biochemistry ,Hybrid algorithm ,Dynamic programming ,Set (abstract data type) ,Sequence Analysis, Protein ,Structural Biology ,Hidden Markov model ,Sequence Alignment ,Molecular Biology ,Algorithm ,Algorithms - Abstract
A detailed analysis of the performance of hybrid, a new sequence alignment algorithm developed by Yu and coworkers that combines Smith Waterman local dynamic programming with a local version of the maximum-likelihood approach, was made to access the applicability of this algorithm to the detection of distant homologs by sequence comparison. We analyzed the statistics of hybrid with a set of nonhomologous protein sequences from the SCOP database and found that the statistics of the scores from hybrid algorithm follows an Extreme Value Distribution with lambda ∼1, as previously shown by Yu et al. for the case of artificially generated sequences. Local dynamic programming was compared to the hybrid algorithm by using two different test data sets of distant homologs from the PFAM and COGs protein sequence databases. The studies were made with several score functions in current use including OPTIMA, a new score function originally developed to detect remote homologs with the Smith Waterman algorithm. We found OPTIMA to be the best score function for both both dynamic programming and the hybrid algorithms. The ability of dynamic programming to discriminate between homologs and nonhomologs in the two sets of distantly related sequences is slightly better than that of hybrid algorithm. The advantage of producing accurate score statistics with only a few simulations may overcome the small differences in performance and make this new algorithm suitable for detection of homologs in conjunction with a wide range of score functions and gap penalties. Proteins 2002;48:367–376. © 2002 Wiley-Liss, Inc.
- Published
- 2002
16. A fast algorithm for genome-wide analysis of proteins with repeated sequences
- Author
-
Todd O. Yeates, Matteo Pellegrini, and Edward M. Marcotte
- Subjects
Genetics ,Smith–Waterman algorithm ,Saccharomyces cerevisiae ,Computational biology ,Biology ,biology.organism_classification ,Biochemistry ,Genome ,Yeast ,Tandem repeat ,Structural Biology ,Gene duplication ,Molecular Biology ,Gene ,Sequence (medicine) - Abstract
We present a fast algorithm to search for repeating fragments within protein sequences. The technique is based on an extension of the Smith-Waterman algorithm that allows the calculation of sub-optimal alignments of a sequence against itself. We are able to estimate the statistical significance of all sub-optimal alignment scores. We also rapidly determine the length of the repeating fragment and the number of times it is found in a sequence. The technique is applied to sequences in the Swissprot database, and to 16 complete genomes. We find that eukaryotic proteins contain more internal repeats than those of prokaryotic and archael organisms. The finding that 18% of yeast sequences and 28% of the known human sequences contain detectable repeats emphasizes the importance of internal duplication in protein evolution.
- Published
- 1999
17. Comparison of methods for searching protein sequence databases
- Author
-
William R. Pearson
- Subjects
Normalization (statistics) ,Smith–Waterman algorithm ,Logarithm ,Sequence alignment ,Regression analysis ,Bioinformatics ,Biochemistry ,Matrix (mathematics) ,Protein sequencing ,Simple function ,Hardware_ARITHMETICANDLOGICSTRUCTURES ,Molecular Biology ,Algorithm ,Mathematics - Abstract
We have compared commonly used sequence comparison algorithms, scoring matrices, and gap penalties using a method that identifies statistically significant differences in performance. Search sensitivity with either the Smith-Waterman algorithm or FASTA is significantly improved by using modern scoring matrices, such as BLOSUM45-55, and optimized gap penalties instead of the conventional PAM250 matrix. More dramatic improvement can be obtained by scaling similarity scores by the logarithm of the length of the library sequence (In()-scaling). With the best modern scoring matrix (BLOSUM55 or JO93) and optimal gap penalties (-12 for the first residue in the gap and -2 for additional residues), Smith-Waterman and FASTA performed significantly better than BLASTP. With In()-scaling and optimal scoring matrices (BLOSUM45 or Gonnet92) and gap penalties (-12, -1), the rigorous Smith-Waterman algorithm performs better than either BLASTP and FASTA, although with the Gonnet92 matrix the difference with FASTA was not significant. Ln()-scaling performed better than normalization based on other simple functions of library sequence length. Ln()-scaling also performed better than scores based on normalized variance, but the differences were not statistically significant for the BLOSUM50 and Gonnet92 matrices. Optimal scoring matrices and gap penalties are reported for Smith-Waterman and FASTA, using conventional or In()-scaled similarity scores. Searches with no penalty for gap extension, or no penalty for gap opening, or an infinite penalty for gaps performed significantly worse than the best methods. Differences in performance between FASTA and Smith-Waterman were not significant when partial query sequences were used. However, the best performance with complete query sequences was obtained with the Smith-Waterman algorithm and In()-scaling.
- Published
- 1995
18. Smith–Waterman Algorithm
- Author
-
Richard Mott
- Subjects
Smith–Waterman algorithm ,Genetics ,chemistry.chemical_compound ,Multiple sequence alignment ,chemistry ,Similarity (network science) ,Computer science ,Sequence alignment ,Computational biology ,human activities ,DNA ,Computer algorithm - Abstract
The Smith–Waterman algorithm is a computer algorithm that finds regions of local similarity between DNA or protein sequences. Keywords: DNA; protein; sequence alignment
- Published
- 2005
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.