1. Acoustic feature extraction using perceptual wavelet packet decomposition for frog call classification
- Author
-
Bungartz, H J, Kranzlmuller, D, Xie, Jie, Towsey, Michael, Eichinski, Phil, Zhang, Jinglan, Roe, Paul, Bungartz, H J, Kranzlmuller, D, Xie, Jie, Towsey, Michael, Eichinski, Phil, Zhang, Jinglan, and Roe, Paul
- Abstract
Frog protection has become increasingly essential due to the rapid decline of its biodiversity. Therefore, it is valuable to develop new methods for studying this biodiversity. In this paper, a novel feature extraction method is proposed based on perceptual wavelet packet decomposition for classifying frog calls in noisy environments. Pre-processing and syllable segmentation are first applied to the frog call. Then, a spectral peak track is extracted from each syllable if possible. Track duration, dominant frequency and oscillation rate are directly extracted from the track. With k-means clustering algorithm, the calculated dominant frequency of all frog species is clustered into k parts, which produce a frequency scale for wavelet packet decomposition. Based on the adaptive frequency scale, wavelet packet decomposition is applied to the frog calls. Using the wavelet packet decomposition coefficients, a new feature set named perceptual wavelet packet decomposition sub-band cepstral coefficients is extracted. Finally, a k-nearest neighbour (k-NN) classifier is used for the classification. The experiment results show that the proposed features can achieve an average classification accuracy of 97.45% which outperforms syllable features (86.87%) and Mel-frequency cepstral coefficients (MFCCs) feature (90.80%).
- Published
- 2015