Contributions to 3D object recognition and 3D hand pose estimation using deep learning techniques

Gomez-Donoso, Francisco

Contributions to 3D object recognition and 3D hand pose estimation using deep learning techniques

Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10045/110658

Información del item - Informació de l'item - Item information
Título:	Contributions to 3D object recognition and 3D hand pose estimation using deep learning techniques
Autor/es:	Gomez-Donoso, Francisco
Director de la investigación:	Cazorla, Miguel
Centro, Departamento o Servicio:	Universidad de Alicante. Instituto Universitario de Investigación Informática
Palabras clave:	3D object recognition \| 2D hand pose estimation \| Deep learning \| Machine learning
Área/s de conocimiento:	Ciencia de la Computación e Inteligencia Artificial
Fecha de creación:	2020
Fecha de publicación:	2020
Fecha de lectura:	18-sep-2020
Editor:	Universidad de Alicante
Resumen:	In this thesis, a study of two blooming fields in the artificial intelligence topic is carried out. The first part of the present document is about 3D object recognition methods. Object recognition in general is about providing the ability to understand what objects appears in the input data of an intelligent system. Any robot, from industrial robots to social robots, could benefit of such capability to improve its performance and carry out high level tasks. In fact, this topic has been largely studied and some object recognition methods present in the state of the art outperform humans in terms of accuracy. Nonetheless, these methods are image-based, namely, they focus in recognizing visual features. This could be a problem in some contexts as there exist objects that look alike some other, different objects. For instance, a social robot that recognizes a face in a picture, or an intelligent car that recognizes a pedestrian in a billboard. A potential solution for this issue would be involving tridimensional data so that the systems would not focus on visual features but topological features. Thus, in this thesis, a study of 3D object recognition methods is carried out. The approaches proposed in this document, which take advantage of deep learning methods, take as an input point clouds and are able to provide the correct category. We evaluated the proposals with a range of public challenges, datasets and real life data with high success. The second part of the thesis is about hand pose estimation. This is also an interesting topic that focuses in providing the hand's kinematics. A range of systems, from human computer interaction and virtual reality to social robots could benefit of such capability. For instance to interface a computer and control it with seamless hand gestures or to interact with a social robot that is able to understand human non-verbal communication methods. Thus, in the present document, hand pose estimation approaches are proposed. It is worth noting that the proposals take as an input color images and are able to provide 2D and 3D hand pose in the image plane and euclidean coordinate frames. Specifically, the hand poses are encoded in a collection of points that represents the joints in a hand, so that they can be easily reconstructed in the full hand pose. The methods are evaluated on custom and public datasets, and integrated with a robotic hand teleoperation application with great success.
URI:	http://hdl.handle.net/10045/110658
Idioma:	eng
Tipo:	info:eu-repo/semantics/doctoralThesis
Derechos:	Licencia Creative Commons Reconocimiento-NoComercial-CompartirIgual 4.0
Aparece en las colecciones:	Tesis doctorales

Archivos en este ítem:

Archivos en este ítem:
Archivo	Descripción	Tamaño	Formato
tesis_francisco_rafael_gomez_donoso.pdf		38,8 MB	Adobe PDF	Abrir Vista previa Cerrar vista previa

Ver citas en Google Académico

Muestra el registro completo