Multilingual extension of a temporal expression normalizer using annotated corpora

Please use this identifier to cite or link to this item:
Información del item - Informació de l'item - Item information
Title: Multilingual extension of a temporal expression normalizer using annotated corpora
Authors: Saquete Boró, Estela | Martínez-Barco, Patricio | Muñoz, Rafael | Negri, Matteo | Speranza, Manuela | Sprugnoli, Rachele
Research Group/s: Procesamiento del Lenguaje y Sistemas de Información (GPLSI)
Center, Department or Service: Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos
Keywords: Multilingual extension | TERSEO | Temporal expression normalizer | Annotated corpora
Knowledge Area: Lenguajes y Sistemas Informáticos
Issue Date: 2006
Publisher: Association for Computational Linguistics (ACL)
Citation: SAQUETE, E., et al. "Multilingual extension of a temporal expression normalizer using annotated corpora". En: Cross-Language Knowledge Induction Workshop : April 3, 2006, Trento, Italy. Stroudsburg, PA : ACL, 2006, pp. 1-8
Abstract: This paper presents the automatic extension to other languages of TERSEO, a knowledge-based system for the recognition and normalization of temporal expressions originally developed for Spanish. TERSEO was first extended to English through the automatic translation of the temporal expressions. Then, an improved porting process was applied to Italian, where the automatic translation of the temporal expressions from English and from Spanish was combined with the extraction of new expressions from an Italian annotated corpus. Experimental results demonstrate how, while still adhering to the rule-based paradigm, the development of automatic rule translation procedures allowed us to minimize the effort required for porting to new languages. Relying on such procedures, and without any manual effort or previous knowledge of the target language, TERSEO recognizes and normalizes temporal expressions in Italian with good results (72% precision and 83% recall for recognition).
Sponsor: This research was partially funded by the Spanish Government (contract TIC2003-07158-C04-01)
Language: eng
Type: info:eu-repo/semantics/conferenceObject
Peer Review: si
Appears in Collections:INV - GPLSI - Comunicaciones a Congresos, Conferencias, etc.

Files in This Item:
Files in This Item:
File Description SizeFormat 
Thumbnail2006_Saquete_EACL.pdf410,14 kBAdobe PDFOpen Preview

Items in RUA are protected by copyright, with all rights reserved, unless otherwise indicated.