Back to Search
Start Over
An Analysis of the Effect of Data Augmentation Methods: Experiments for a Musical Genre Classification Task
- Source :
- Transactions of the International Society for Music Information Retrieval, Transactions of the International Society for Music Information Retrieval, Vol 2, Iss 1 (2019)
- Publication Year :
- 2019
- Publisher :
- Ubiquity Press, Ltd., 2019.
-
Abstract
- Supervised machine learning relies on the accessibility of large datasets of annotated data. This is essential since small datasets generally lead to overfitting when training high-dimensional machine-learning models. Since the manual annotation of such large datasets is a long, tedious and expensive process, another possibility is to artificially increase the size of the dataset. This is known as data augmentation. In this paper we provide an in-depth analysis of two data augmentation methods: sound transformations and sound segmentation. The first transforms a music track to a set of new music tracks by applying processes such as pitch-shifting, time-stretching or filtering. The second one splits a long sound signal into a set of shorter time segments. We study the effect of these two techniques (and the parameters of those) for a genre classification task using public datasets. The main contribution of this work is to detail by experimentation the benefit of these methods, used alone or together, during training and/or testing. We also demonstrate their use in improving the robustness of potentially unknown sound degradations. By analyzing these results, good practice recommendations are provided.
- Subjects :
- lcsh:M1-5000
Computer science
02 engineering and technology
Musical
Overfitting
Machine learning
computer.software_genre
Robustness (computer science)
0202 electrical engineering, electronic engineering, information engineering
Segmentation
supervised training
Good practice
Supervised training
lcsh:Music
Audio signal
lcsh:T58.5-58.64
lcsh:Information technology
business.industry
datasets
020206 networking & telecommunications
Manual annotation
Data Augmentation
Datasets
Musical genre classification
020201 artificial intelligence & image processing
Artificial intelligence
musical genre classification
business
computer
data augmentation
Subjects
Details
- ISSN :
- 25143298
- Volume :
- 2
- Database :
- OpenAIRE
- Journal :
- Transactions of the International Society for Music Information Retrieval
- Accession number :
- edsair.doi.dedup.....f2bd13c222f7c962cf54ea607917a8b1