Back to Search Start Over

Length-weighted string kernels for sequence data classification

Authors :
Tian, Shengfeng
Mu, Shaomin
Yin, Chuanhuan
Source :
Pattern Recognition Letters. Oct2007, Vol. 28 Issue 13, p1651-1656. 6p.
Publication Year :
2007

Abstract

Abstract: Various sequence-similarity kernels, the string kernels, have been introduced for use with support vector machines (SVMs) in a discriminative approach to the sequence data classification problems. In these applications, string kernels are asked to be similarity measures between strings. In this paper, we present a new string kernel and its variants suitable to sequence data classification, which are determined by (possibly non-contiguous) matching subsequences with all possible lengths shared by two strings. In these kernels, gaps in subsequences are allowed and the longer subsequences contribute more to the value of kernels. Efficient algorithms of computing the kernels are derived with the techniques of dynamic programming and bit-parallelism. In some cases, the computation of the kernel is linear in the length of the strings. [Copyright &y& Elsevier]

Details

Language :
English
ISSN :
01678655
Volume :
28
Issue :
13
Database :
Academic Search Index
Journal :
Pattern Recognition Letters
Publication Type :
Academic Journal
Accession number :
25825881
Full Text :
https://doi.org/10.1016/j.patrec.2007.04.008