Spelling Normalization of Historical Documents by Using a Machine Translation Approach
Empreu sempre aquest identificador per citar o enllaçar aquest ítem
http://hdl.handle.net/10045/76035
Títol: | Spelling Normalization of Historical Documents by Using a Machine Translation Approach |
---|---|
Autors: | Domingo, Miguel | Casacuberta, Francisco |
Paraules clau: | Machine Translation |
Àrees de coneixement: | Lenguajes y Sistemas Informáticos |
Data de publicació: | 2018 |
Editor: | European Association for Machine Translation |
Citació bibliogràfica: | Domingo, Miguel; Casacuberta, Francisco. “Spelling Normalization of Historical Documents by Using a Machine Translation Approach”. In: Pérez-Ortiz, Juan Antonio, et al. (Eds.). Proceedings of the 21st Annual Conference of the European Association for Machine Translation: 28-30 May 2018, Universitat d'Alacant, Alacant, Spain, pp. 129-137 |
Resum: | The lack of a spelling convention in historical documents makes their orthography to change depending on the author and the time period in which each document was written. This represents a problem for the preservation of the cultural heritage, which strives to create a digital text version of a historical document. With the aim of solving this problem, we propose three approaches—based on statistical, neural and character-based machine translation— to adapt the document’s spelling to modern standards. We tested these approaches in different scenarios, obtaining very encouraging results. |
Patrocinadors: | The research leading to these results has received funding from the Ministerio de Economía y Competitividad (MINECO) under project CoMUN-HaT (grant agreement TIN2015-70924-C2-1-R), and Generalitat Valenciana (grant agreement PROMETEO/2018/004). |
URI: | http://hdl.handle.net/10045/76035 |
ISBN: | 978-84-09-01901-4 |
Idioma: | eng |
Tipus: | info:eu-repo/semantics/conferenceObject |
Drets: | © 2018 The authors. This article is licensed under a Creative Commons 3.0 licence, no derivative works, attribution, CC-BY-ND. |
Revisió científica: | si |
Versió de l'editor: | http://eamt2018.dlsi.ua.es/proceedings-eamt2018.pdf |
Apareix a la col·lecció: | EAMT2018 - Proceedings |
Arxius per aquest ítem:
Arxiu | Descripció | Tamany | Format | |
---|---|---|---|---|
EAMT2018-Proceedings_15.pdf | 1,47 MB | Adobe PDF | Obrir Vista prèvia | |
Aquest ítem està subjecte a una llicència de Creative Commons Llicència Creative Commons