Back to Search
Start Over
Fake speech detection using VGGish with attention block
- Source :
- EURASIP Journal on Audio, Speech, and Music Processing, Vol 2024, Iss 1, Pp 1-19 (2024)
- Publication Year :
- 2024
- Publisher :
- SpringerOpen, 2024.
-
Abstract
- Abstract While deep learning technologies have made remarkable progress in generating deepfakes, their misuse has become a well-known concern. As a result, the ubiquitous usage of deepfakes for increasing false information poses significant risks to the security and privacy of individuals. The primary objective of audio spoofing detection is to identify audio generated through numerous AI-based techniques. Several techniques for fake audio detection already exist using machine learning algorithms. However, they lack generalization and may not identify all types of AI-synthesized audios such as replay attacks, voice conversion, and text-to-speech (TTS). In this paper, a deep layered model, i.e., VGGish, along with an attention block, namely Convolutional Block Attention Module (CBAM) for spoofing detection, is introduced. Our suggested model successfully classifies input audio into two classes: Fake and Real, converting them into mel-spectrograms, and extracting their most representative features due to the attention block. Our model is a significant technique to utilize for audio spoofing detection due to a simple layered architecture. It captures complex relationships in audio signals due to both spatial and channel features present in an attention module. To evaluate the effectiveness of our model, we have conducted in-depth testing using the ASVspoof 2019 dataset. The proposed technique achieved an EER of 0.52% for Physical Access (PA) attacks and 0.07 % for Logical Access (LA) attacks.
Details
- Language :
- English
- ISSN :
- 16874722
- Volume :
- 2024
- Issue :
- 1
- Database :
- Directory of Open Access Journals
- Journal :
- EURASIP Journal on Audio, Speech, and Music Processing
- Publication Type :
- Academic Journal
- Accession number :
- edsdoj.8355bb6e70a47dfb902fa6d10e63f9f
- Document Type :
- article
- Full Text :
- https://doi.org/10.1186/s13636-024-00348-4