Semi-supervised 3D object recognition through CNN labeling

Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10045/74443
Información del item - Informació de l'item - Item information
Título: Semi-supervised 3D object recognition through CNN labeling
Autor/es: Rangel, José Carlos | Martínez-Gómez, Jesús | Romero-González, Cristina | García-Varea, Ismael | Cazorla, Miguel
Grupo/s de investigación o GITE: Robótica y Visión Tridimensional (RoViT)
Centro, Departamento o Servicio: Universidad de Alicante. Departamento de Ciencia de la Computación e Inteligencia Artificial | Universidad de Alicante. Instituto Universitario de Investigación Informática
Palabras clave: Object recognition | Deep learning | Object labeling | Machine learning
Área/s de conocimiento: Ciencia de la Computación e Inteligencia Artificial
Fecha de publicación: abr-2018
Editor: Elsevier
Cita bibliográfica: Applied Soft Computing. 2018, 65: 603-613. doi:10.1016/j.asoc.2018.02.005
Resumen: Despite the outstanding results of Convolutional Neural Networks (CNNs) in object recognition and classification, there are still some open problems to address when applying these solutions to real-world problems. Specifically, CNNs struggle to generalize under challenging scenarios, like recognizing the variability and heterogeneity of the instances of elements belonging to the same category. Some of these difficulties are directly related to the input information, 2D-based methods still show a lack of robustness against strong lighting variations, for example. In this paper, we propose to merge techniques using both 2D and 3D information to overcome these problems. Specifically, we take advantage of the spatial information in the 3D data to segment objects in the image and build an object classifier, and the classification capabilities of CNNs to semi-supervisedly label each object image for training. As the experimental results demonstrate, our model can successfully generalize for categories with high intra-class variability and outperform the accuracy of a well-known CNN model.
Patrocinador/es: This work has been partially sponsored by the Spanish Ministry of Economy and Competitiveness under Grant Number TIN2015-65686-C5-3-R. It has been also supported by the Spanish Government TIN2016-76515-R Grant, supported with Feder funds. Cristina Romero-González is funded by the MECD Grant FPU12/04387. José Carlos Rangel is funded by the IFARHU Grant 8-2014-166 of the Republic of Panamá.
URI: http://hdl.handle.net/10045/74443
ISSN: 1568-4946 (Print) | 1872-9681 (Online)
DOI: 10.1016/j.asoc.2018.02.005
Idioma: eng
Tipo: info:eu-repo/semantics/article
Derechos: © 2018 Elsevier B.V.
Revisión científica: si
Versión del editor: https://doi.org/10.1016/j.asoc.2018.02.005
Aparece en las colecciones:INV - RoViT - Artículos de Revistas

Archivos en este ítem:
Archivos en este ítem:
Archivo Descripción TamañoFormato 
Thumbnail2018_Rangel_etal_ApplSoftComp_final.pdfVersión final (acceso restringido)3,91 MBAdobe PDFAbrir    Solicitar una copia
Thumbnail2018_Rangel_etal_ApplSoftComp_accepted.pdfAccepted Manuscript (acceso abierto)5,16 MBAdobe PDFAbrir Vista previa


Todos los documentos en RUA están protegidos por derechos de autor. Algunos derechos reservados.