Back to Search Start Over

Generalization error of random feature and kernel methods: Hypercontractivity and kernel matrix concentration.

Authors :
Mei, Song
Misiakiewicz, Theodor
Montanari, Andrea
Source :
Applied & Computational Harmonic Analysis. Jul2022, Vol. 59, p3-84. 82p.
Publication Year :
2022

Abstract

Consider the classical supervised learning problem: we are given data (y i , x i) , i ≤ n , with y i a response and x i ∈ X a covariates vector, and try to learn a model f ˆ : X → R to predict future responses. Random feature methods map the covariates vector x i to a point ϕ (x i) in a higher dimensional space R N , via a random featurization map ϕ . We study the use of random feature methods in conjunction with ridge regression in the feature space R N. This can be viewed as a finite-dimensional approximation of kernel ridge regression (KRR), or as a stylized model for neural networks in the so called lazy training regime. We define a class of problems satisfying certain spectral conditions on the underlying kernels, and a hypercontractivity assumption on the associated eigenfunctions. These conditions are verified by classical high-dimensional examples. Under these conditions, we prove a sharp characterization of the error of random feature ridge regression. In particular, we address two fundamental questions: (1) What is the generalization error of KRR? (2) How big N should be for the random feature approximation to achieve the same error as KRR? In this setting, we prove that KRR is well approximated by a projection onto the top ℓ eigenfunctions of the kernel, where ℓ depends on the sample size n. We show that the test error of random feature ridge regression is dominated by its approximation error and is larger than the error of KRR as long as N ≤ n 1 − δ for some δ > 0. We characterize this gap. For N ≥ n 1 + δ , random features achieve the same error as the corresponding KRR, and further increasing N does not lead to a significant change in test error. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
10635203
Volume :
59
Database :
Academic Search Index
Journal :
Applied & Computational Harmonic Analysis
Publication Type :
Academic Journal
Accession number :
156503103
Full Text :
https://doi.org/10.1016/j.acha.2021.12.003