A trigram part-of-speech tagger for the Apertium free/open-source machine translation platform

Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10045/12032
Registro completo de metadatos
Registro completo de metadatos
Campo DCValorIdioma
dc.contributorTransducensen
dc.contributor.authorSheikh, Zaid Md Abdul Wahab-
dc.contributor.authorSánchez-Martínez, Felipe-
dc.contributor.otherUniversidad de Alicante. Departamento de Lenguajes y Sistemas Informáticosen
dc.date.accessioned2009-10-27T11:57:22Z-
dc.date.available2009-10-27T11:57:22Z-
dc.date.issued2009-11-
dc.identifier.citationSHEIKH, Zaid Md Abdul Wahab; SÁNCHEZ-MARTÍNEZ, Felipe. "A trigram part-of-speech tagger for the Apertium free/open-source machine translation platform". En: Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation / Edited by Juan Antonio Pérez-Ortiz, Felipe Sánchez-Martínez, Francis M. Tyers. Alicante : Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, 2009, pp. 67-74en
dc.identifier.urihttp://hdl.handle.net/10045/12032-
dc.description.abstractThis paper describes the implementation of a second-order hidden Markov model (HMM) based part-of-speech tagger for the Apertium free/open-source rule-based machine translation platform. We describe the part-of-speech (PoS) tagging approach in Apertium and how it is parametrised through a tagger definition file that defines: (1) the set of tags to be used and (2) constrain rules that can be used to forbid certain PoS tag sequences, thus re-fining the HMM parameters and increasing its tagging accuracy. The paper also reviews the Baum-Welch algorithm used to estimate the HMM parameters and compares the tagging accuracy achieved with that achieved by the original, first-order HMM-based PoS tagger in Apertium.en
dc.description.sponsorshipGoogle Summer of Code 2009 program, and the Spanish Ministry of Science and Innovation under project TIN2009-14009-C02-01.en
dc.languageengen
dc.publisherUniversidad de Alicante. Departamento de Lenguajes y Sistemas Informáticosen
dc.subjectHidden Markov Modelen
dc.subjectPart-of-speech taggeren
dc.subjectMachine translationen
dc.subjectApertiumen
dc.subject.otherLenguajes y Sistemas Informáticosen
dc.titleA trigram part-of-speech tagger for the Apertium free/open-source machine translation platformen
dc.typeinfo:eu-repo/semantics/articleen
dc.peerreviewedsien
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess-
Aparece en las colecciones:Freerbmt09 - Ponencias
INV - TRANSDUCENS - Comunicaciones a Congresos, Conferencias, etc.

Archivos en este ítem:
Archivos en este ítem:
Archivo Descripción TamañoFormato 
Thumbnailpaper9.pdf239,77 kBAdobe PDFAbrir Vista previa


Todos los documentos en RUA están protegidos por derechos de autor. Algunos derechos reservados.