Back to Search Start Over

A Primer on the Signature Method in Machine Learning

Authors :
Chevyrev, Ilya
Kormilitzin, Andrey
Publication Year :
2016

Abstract

We provide an introduction to the signature method, focusing on its theoretical properties and machine learning applications. Our presentation is divided into two parts. In the first part, we present the definition and fundamental properties of the signature of a path. The signature is a sequence of numbers associated with a path that captures many of its important analytic and geometric properties. As a sequence of numbers, the signature serves as a compact description (dimension reduction) of a path. In presenting its theoretical properties, we assume only familiarity with classical real analysis and integration, and supplement theory with straightforward examples. We also mention several advanced topics, including the role of the signature in rough path theory. In the second part, we present practical applications of the signature to the area of machine learning. The signature method is a non-parametric way of transforming data into a set of features that can be used in machine learning tasks. In this method, data are converted into multi-dimensional paths, by means of embedding algorithms, of which the signature is then computed. We describe this pipeline in detail, making a link with the properties of the signature presented in the first part. We furthermore review some of the developments of the signature method in machine learning and, as an illustrative example, present a detailed application of the method to handwritten digit classification.<br />Comment: 61 pages, 26 figures, 3 tables. Expanded Part 1 and simplified the presentation in Part 2. To appear in Open Access in a forthcoming Springer volume "Signatures Methods in Finance: An Introduction with Computational Applications"

Details

Database :
arXiv
Publication Type :
Report
Accession number :
edsarx.1603.03788
Document Type :
Working Paper