Back to Search
Start Over
Distributed adaptive Huber regression
- Source :
- Computational Statistics & Data Analysis. 169:107419
- Publication Year :
- 2022
- Publisher :
- Elsevier BV, 2022.
-
Abstract
- Distributed data naturally arise in scenarios involving multiple sources of observations, each stored at a different location. Directly pooling all the data together is often prohibited due to limited bandwidth and storage, or due to privacy protocols. This paper introduces a new robust distributed algorithm for fitting linear regressions when data are subject to heavy-tailed and/or asymmetric errors with finite second moments. The algorithm only communicates gradient information at each iteration and therefore is communication-efficient. Statistically, the resulting estimator achieves the centralized nonasymptotic error bound as if all the data were pooled together and came from a distribution with sub-Gaussian tails. Under a finite $(2+\delta)$-th moment condition, we derive a Berry-Esseen bound for the distributed estimator, based on which we construct robust confidence intervals. Numerical studies further confirm that compared with extant distributed methods, the proposed methods achieve near-optimal accuracy with low variability and better coverage with tighter confidence width.<br />Comment: 29 pages
- Subjects :
- FOS: Computer and information sciences
Statistics and Probability
Adaptive Huber regression
Nonasymptotic analysis
Statistics & Probability
Applied Mathematics
Statistics
Computation Theory and Mathematics
Distributed inference
Methodology (stat.ME)
Computational Mathematics
Computational Theory and Mathematics
Heavy-tailed distribution
Econometrics
Communication efficiency
Statistics - Methodology
Subjects
Details
- ISSN :
- 01679473
- Volume :
- 169
- Database :
- OpenAIRE
- Journal :
- Computational Statistics & Data Analysis
- Accession number :
- edsair.doi.dedup.....a78dea16bb355691fa3b17631aea2257