COMPENDIUM: A text summarization system for generating abstracts of research papers
Please use this identifier to cite or link to this item:
http://hdl.handle.net/10045/33138
Title: | COMPENDIUM: A text summarization system for generating abstracts of research papers |
---|---|
Authors: | Lloret, Elena | Romá-Ferri, María Teresa | Palomar, Manuel |
Research Group/s: | Procesamiento del Lenguaje y Sistemas de Información (GPLSI) |
Center, Department or Service: | Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos | Universidad de Alicante. Departamento de Enfermería |
Keywords: | Human language technologies | NLP applications | Text summarization | Information systems |
Knowledge Area: | Lenguajes y Sistemas Informáticos |
Issue Date: | 14-Aug-2013 |
Publisher: | Elsevier |
Citation: | LLORET, Elena; ROMÁ-FERRI, María Teresa; PALOMAR, Manuel. "COMPENDIUM: A text summarization system for generating abstracts of research papers". Data & Knowledge Engineering. Article In Press (Available online 14 August 2013). ISSN 0169-023X |
Abstract: | This article analyzes the appropriateness of a text summarization system, COMPENDIUM, for generating abstracts of biomedical papers. Two approaches are suggested: an extractive (COMPENDIUM E), which only selects and extracts the most relevant sentences of the documents, and an abstractive-oriented one (COMPENDIUM E–A), thus facing also the challenge of abstractive summarization. This novel strategy combines extractive information, with some pieces of information of the article that have been previously compressed or fused. Specifically, in this article, we want to study: i) whether COMPENDIUM produces good summaries in the biomedical domain; ii) which summarization approach is more suitable; and iii) the opinion of real users towards automatic summaries. Therefore, two types of evaluation were performed: quantitative and qualitative, for evaluating both the information contained in the summaries, as well as the user satisfaction. Results show that extractive and abstractive-oriented summaries perform similarly as far as the information they contain, so both approaches are able to keep the relevant information of the source documents, but the latter is more appropriate from a human perspective, when a user satisfaction assessment is carried out. This also confirms the suitability of our suggested approach for generating summaries following an abstractive-oriented paradigm. |
Sponsor: | This research was partially supported by the FPI grant (BES-2007-16268) and the project grants TEXT-MESS (TIN2006-15265-C06-01), TEXT-MESS 2.0 (TIN2009-13391-C04) and LEGOLANG (TIN2012-31224) from the Spanish Government. It has been also funded by the Valencian Government (grant no. PROMETEO/2009/119 and ACOMP/2011/001). |
URI: | http://hdl.handle.net/10045/33138 |
ISSN: | 0169-023X (Print) | 1872-6933 (Online) |
DOI: | 10.1016/j.datak.2013.08.005 |
Language: | eng |
Type: | info:eu-repo/semantics/article |
Peer Review: | si |
Publisher version: | http://dx.doi.org/10.1016/j.datak.2013.08.005 |
Appears in Collections: | INV - GPLSI - Artículos de Revistas |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
1-s2.0-S0169023X13000815-main.pdf | Versión final (acceso restringido) | 1,07 MB | Adobe PDF | Open Request a copy |
Items in RUA are protected by copyright, with all rights reserved, unless otherwise indicated.