Back to Search Start Over

Stochastic Subsampling for Factorizing Huge Matrices.

Authors :
Mensch, Arthur
Mairal, Julien
Thirion, Bertrand
Varoquaux, Gael
Source :
IEEE Transactions on Signal Processing; Jan2018, Vol. 66 Issue 1, p113-128, 16p
Publication Year :
2018

Abstract

We present a matrix-factorization algorithm that scales to input matrices with both huge number of rows and columns. Learned factors may be sparse or dense and/or nonnegative, which makes our algorithm suitable for dictionary learning, sparse component analysis, and nonnegative matrix factorization. Our algorithm streams matrix columns while subsampling them to iteratively learn the matrix factors. At each iteration, the row dimension of a new sample is reduced by subsampling, resulting in lower time complexity compared to a simple streaming algorithm. Our method comes with convergence guarantees to reach a stationary point of the matrix-factorization problem. We demonstrate its efficiency on massive functional magnetic resonance imaging data (2 TB), and on patches extracted from hyperspectral images (103 GB). For both problems, which involve different penalties on rows and columns, we obtain significant speed-ups compared to state-of-the-art algorithms. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
1053587X
Volume :
66
Issue :
1
Database :
Complementary Index
Journal :
IEEE Transactions on Signal Processing
Publication Type :
Academic Journal
Accession number :
127950165
Full Text :
https://doi.org/10.1109/TSP.2017.2752697