Recognition of Japanese handwritten characters with Machine learning techniques

Please use this identifier to cite or link to this item: http://hdl.handle.net/10045/109318
Información del item - Informació de l'item - Item information
Title: Recognition of Japanese handwritten characters with Machine learning techniques
Authors: Tomás Pérez, José Vicente
Research Director: Iñesta, José M.
Center, Department or Service: Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos
Keywords: Machine Learning | Deep Learning | Python | Flask | OpenCV | Japanese | Datasets | OCR | Recognition | Computer Vision | Canvas | Web | Crowd-sourcing | Keras | Tensorflow | Neural Networks | Convolutional Neural Networks | Reconocimiento | Japonés | Redes Neuronales | Redes Neuronales Convolucionales | CNN | ANN | WSGI
Knowledge Area: Lenguajes y Sistemas Informáticos
Issue Date: 18-Sep-2020
Date of defense: 11-Sep-2020
Abstract: The recognition of Japanese handwritten characters has always been a challenge for researchers. A large number of classes, their graphic complexity, and the existence of three different writing systems make this problem particularly difficult compared to Western writing. For decades, attempts have been made to address the problem using traditional OCR (Optical Character Recognition) techniques, with mixed results. With the recent popularization of machine learning techniques through neural networks, this research has been revitalized, bringing new approaches to the problem. These new results achieve performance levels comparable to human recognition. Furthermore, these new techniques have allowed collaboration with very different disciplines, such as the Humanities or East Asian studies, achieving advances in them that would not have been possible without this interdisciplinary work. In this thesis, these techniques are explored until reaching a sufficient level of understanding that allows us to carry out our own experiments, training neural network models with public datasets of Japanese characters. However, the scarcity of public datasets makes the task of researchers remarkably difficult. Our proposal to minimize this problem is the development of a web application that allows researchers to easily collect samples of Japanese characters through the collaboration of any user. Once the application is fully operational, the examples collected until that point will be used to create a new dataset in a specific format. Finally, we can use the new data to carry out comparative experiments with the previous neural network models.
URI: http://hdl.handle.net/10045/109318
Language: eng
Type: info:eu-repo/semantics/bachelorThesis
Rights: Licencia Creative Commons Reconocimiento-NoComercial-SinObraDerivada 4.0
Appears in Collections:Grado en Ingeniería Multimedia - Trabajos Fin de Grado

Files in This Item:


Items in RUA are protected by copyright, with all rights reserved, unless otherwise indicated.