Accurate Multilevel Classification for Wildlife Images

Please use this identifier to cite or link to this item:
Información del item - Informació de l'item - Item information
Title: Accurate Multilevel Classification for Wildlife Images
Authors: Gomez-Donoso, Francisco | Escalona, Félix | Pérez Esteve, Ferran | Cazorla, Miguel
Research Group/s: Robótica y Visión Tridimensional (RoViT)
Center, Department or Service: Universidad de Alicante. Departamento de Ciencia de la Computación e Inteligencia Artificial | Universidad de Alicante. Instituto Universitario de Investigación Informática
Keywords: Multilevel classification | Wild animals | Plant species | Wildlife images | Accuracy
Knowledge Area: Ciencia de la Computación e Inteligencia Artificial
Issue Date: 2-Apr-2021
Publisher: Hindawi
Citation: Computational Intelligence and Neuroscience. Volume 2021, Article ID 6690590, 11 pages.
Abstract: The most common approaches for classification rely on the inference of a specific class. However, every category could be naturally organized within a taxonomic tree, from the most general concept to the specific element, and that is how human knowledge works. This representation avoids the necessity of learning roughly the same features for a range of very similar categories, and it is easier to understand and work with and provides a classification for each abstraction level. In this paper, we carry out an exhaustive study of different methods to perform multilevel classification applied to the task of classifying wild animals and plant species. Different convolutional backbones, data setups, and ensembling techniques are explored to find the model which provides the best performance. As our experimentation remarks, in order to achieve the best performance on the datasets that are arranged in a tree-like structure, the classifier must feature an EfficientNetB5 backbone with an input size of px, followed by a multilevel classifier. In addition, a Multiscale Crop data augmentation process must be carried out. Finally, the accuracy of this setup is a 62% top-1 accuracy and 88% top-5 accuracy. The architecture could benefit for an accuracy boost if it is involved in an ensemble of cascade classifiers, but the computational demand is unbearable for any real application.
Sponsor: This work was funded by the Spanish Government PID2019-104818RB-I00 grant, supported with FEDER funds. It was supported by Spanish grants for PhD studies ACIF/2017/243 and FPU16/00887.
ISSN: 1687-5265 (Print) | 1687-5273 (Online)
DOI: 10.1155/2021/6690590
Language: eng
Type: info:eu-repo/semantics/article
Rights: © 2021 Francisco Gomez-Donoso et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Peer Review: si
Publisher version:
Appears in Collections:INV - RoViT - Artículos de Revistas

Files in This Item:
Files in This Item:
File Description SizeFormat 
ThumbnailGomez-Donoso_etal_2021_ComputIntelligNeurosci.pdf4,85 MBAdobe PDFOpen Preview

This item is licensed under a Creative Commons License Creative Commons