1. Accurate Multilevel Classification for Wildlife Images
- Author
-
Francisco Gomez-Donoso, Felix Escalona, Ferran Pérez-Esteve, Miguel Cazorla, Universidad de Alicante. Departamento de Ciencia de la Computación e Inteligencia Artificial, Universidad de Alicante. Instituto Universitario de Investigación Informática, and Robótica y Visión Tridimensional (RoViT)
- Subjects
Article Subject ,General Computer Science ,Process (engineering) ,Computer science ,General Mathematics ,Computer applications to medicine. Medical informatics ,R858-859.7 ,Inference ,Animals, Wild ,Neurosciences. Biological psychiatry. Neuropsychiatry ,02 engineering and technology ,Machine learning ,computer.software_genre ,Wild animals ,Abstraction layer ,0502 economics and business ,0202 electrical engineering, electronic engineering, information engineering ,Feature (machine learning) ,Animals ,Humans ,Representation (mathematics) ,Accuracy ,business.industry ,General Neuroscience ,05 social sciences ,Ciencia de la Computación e Inteligencia Artificial ,General Medicine ,Class (biology) ,Tree (data structure) ,Wildlife images ,Multilevel classification ,Plant species ,050211 marketing ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,Classifier (UML) ,computer ,Research Article ,RC321-571 - Abstract
The most common approaches for classification rely on the inference of a specific class. However, every category could be naturally organized within a taxonomic tree, from the most general concept to the specific element, and that is how human knowledge works. This representation avoids the necessity of learning roughly the same features for a range of very similar categories, and it is easier to understand and work with and provides a classification for each abstraction level. In this paper, we carry out an exhaustive study of different methods to perform multilevel classification applied to the task of classifying wild animals and plant species. Different convolutional backbones, data setups, and ensembling techniques are explored to find the model which provides the best performance. As our experimentation remarks, in order to achieve the best performance on the datasets that are arranged in a tree-like structure, the classifier must feature an EfficientNetB5 backbone with an input size of 300 × 300 px, followed by a multilevel classifier. In addition, a Multiscale Crop data augmentation process must be carried out. Finally, the accuracy of this setup is a 62% top-1 accuracy and 88% top-5 accuracy. The architecture could benefit for an accuracy boost if it is involved in an ensemble of cascade classifiers, but the computational demand is unbearable for any real application.
- Published
- 2021
- Full Text
- View/download PDF