Back to Search Start Over

Speech Boosting: Low-Latency Live Speech Enhancement for TWS Earbuds

Authors :
Bae, Hanbin
Andreev, Pavel
Saginbaev, Azat
Babaev, Nicholas
Lee, Won-Jun
Sung, Hosang
Cho, Hoon-Young
Publication Year :
2024

Abstract

This paper introduces a speech enhancement solution tailored for true wireless stereo (TWS) earbuds on-device usage. The solution was specifically designed to support conversations in noisy environments, with active noise cancellation (ANC) activated. The primary challenges for speech enhancement models in this context arise from computational complexity that limits on-device usage and latency that must be less than 3 ms to preserve a live conversation. To address these issues, we evaluated several crucial design elements, including the network architecture and domain, design of loss functions, pruning method, and hardware-specific optimization. Consequently, we demonstrated substantial improvements in speech enhancement quality compared with that in baseline models, while simultaneously reducing the computational complexity and algorithmic latency.<br />Comment: Accepted by Interspeech 2024

Details

Database :
arXiv
Publication Type :
Report
Accession number :
edsarx.2409.18705
Document Type :
Working Paper
Full Text :
https://doi.org/10.21437/Interspeech.2024-1444