Back to Search Start Over

Fake speech detection using VGGish with attention block

Authors :
Tahira Kanwal
Rabbia Mahum
Abdul Malik AlSalman
Mohamed Sharaf
Haseeb Hassan
Source :
EURASIP Journal on Audio, Speech, and Music Processing, Vol 2024, Iss 1, Pp 1-19 (2024)
Publication Year :
2024
Publisher :
SpringerOpen, 2024.

Abstract

Abstract While deep learning technologies have made remarkable progress in generating deepfakes, their misuse has become a well-known concern. As a result, the ubiquitous usage of deepfakes for increasing false information poses significant risks to the security and privacy of individuals. The primary objective of audio spoofing detection is to identify audio generated through numerous AI-based techniques. Several techniques for fake audio detection already exist using machine learning algorithms. However, they lack generalization and may not identify all types of AI-synthesized audios such as replay attacks, voice conversion, and text-to-speech (TTS). In this paper, a deep layered model, i.e., VGGish, along with an attention block, namely Convolutional Block Attention Module (CBAM) for spoofing detection, is introduced. Our suggested model successfully classifies input audio into two classes: Fake and Real, converting them into mel-spectrograms, and extracting their most representative features due to the attention block. Our model is a significant technique to utilize for audio spoofing detection due to a simple layered architecture. It captures complex relationships in audio signals due to both spatial and channel features present in an attention module. To evaluate the effectiveness of our model, we have conducted in-depth testing using the ASVspoof 2019 dataset. The proposed technique achieved an EER of 0.52% for Physical Access (PA) attacks and 0.07 % for Logical Access (LA) attacks.

Details

Language :
English
ISSN :
16874722
Volume :
2024
Issue :
1
Database :
Directory of Open Access Journals
Journal :
EURASIP Journal on Audio, Speech, and Music Processing
Publication Type :
Academic Journal
Accession number :
edsdoj.8355bb6e70a47dfb902fa6d10e63f9f
Document Type :
article
Full Text :
https://doi.org/10.1186/s13636-024-00348-4