Spelling Normalization of Historical Documents by Using a Machine Translation Approach

Empreu sempre aquest identificador per citar o enllaçar aquest ítem http://hdl.handle.net/10045/76035
Información del item - Informació de l'item - Item information
Títol: Spelling Normalization of Historical Documents by Using a Machine Translation Approach
Autors: Domingo, Miguel | Casacuberta, Francisco
Paraules clau: Machine Translation
Àrees de coneixement: Lenguajes y Sistemas Informáticos
Data de publicació: 2018
Editor: European Association for Machine Translation
Citació bibliogràfica: Domingo, Miguel; Casacuberta, Francisco. “Spelling Normalization of Historical Documents by Using a Machine Translation Approach”. In: Pérez-Ortiz, Juan Antonio, et al. (Eds.). Proceedings of the 21st Annual Conference of the European Association for Machine Translation: 28-30 May 2018, Universitat d'Alacant, Alacant, Spain, pp. 129-137
Resum: The lack of a spelling convention in historical documents makes their orthography to change depending on the author and the time period in which each document was written. This represents a problem for the preservation of the cultural heritage, which strives to create a digital text version of a historical document. With the aim of solving this problem, we propose three approaches—based on statistical, neural and character-based machine translation— to adapt the document’s spelling to modern standards. We tested these approaches in different scenarios, obtaining very encouraging results.
Patrocinadors: The research leading to these results has received funding from the Ministerio de Economía y Competitividad (MINECO) under project CoMUN-HaT (grant agreement TIN2015-70924-C2-1-R), and Generalitat Valenciana (grant agreement PROMETEO/2018/004).
URI: http://hdl.handle.net/10045/76035
ISBN: 978-84-09-01901-4
Idioma: eng
Tipus: info:eu-repo/semantics/conferenceObject
Drets: © 2018 The authors. This article is licensed under a Creative Commons 3.0 licence, no derivative works, attribution, CC-BY-ND.
Revisió científica: si
Versió de l'editor: http://eamt2018.dlsi.ua.es/proceedings-eamt2018.pdf
Apareix a la col·lecció: EAMT2018 - Proceedings

Arxius per aquest ítem:
Arxius per aquest ítem:
Arxiu Descripció Tamany Format  
ThumbnailEAMT2018-Proceedings_15.pdf1,47 MBAdobe PDFObrir Vista prèvia


Aquest ítem està subjecte a una llicència de Creative Commons Llicència Creative Commons Creative Commons