A benchmark of Spanish language datasets for computationally driven research

Please use this identifier to cite or link to this item: http://hdl.handle.net/10045/120141
Información del item - Informació de l'item - Item information
Title: A benchmark of Spanish language datasets for computationally driven research
Authors: Candela, Gustavo | Sáez Fernández, María Dolores | Escobar Esteban, María Pilar | Marco Such, Manuel
Research Group/s: Lucentia | Transducens
Center, Department or Service: Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos
Keywords: Collections as data | Data quality metrics | Digital libraries | GLAM labs
Knowledge Area: Lenguajes y Sistemas Informáticos
Issue Date: 13-Dec-2021
Publisher: SAGE Publications
Citation: Journal of Information Science. 2023, 49(6): 1451-1461. https://doi.org/10.1177/01655515211060530
Abstract: In the domain of Galleries, Libraries, Archives and Museums (GLAM) institutions, creative and innovative tools and methodologies for content delivery and user engagement have recently gained international attention. New methods have been proposed to publish digital collections as datasets amenable to computational use. Standardised benchmarks can be useful to broaden the scope of machine-actionable collections and to promote cultural and linguistic diversity. In this article, we propose a methodology to select datasets for computationally driven research applied to Spanish text corpora. This work seeks to encourage Spanish and Latin American institutions to publish machine-actionable collections based on best practices and avoiding common mistakes.
Sponsor: This research has been funded by the AETHER-UA (PID2020-112540RB-C43) Project from the Spanish Ministry of Science and Innovation.
URI: http://hdl.handle.net/10045/120141
ISSN: 0165-5515 (Print) | 1741-6485 (Online)
DOI: 10.1177/01655515211060530
Language: eng
Type: info:eu-repo/semantics/article
Rights: © The Author(s) 2021
Peer Review: si
Publisher version: https://doi.org/10.1177/01655515211060530
Appears in Collections:INV - TRANSDUCENS - Artículos de Revistas
INV - LUCENTIA - Artículos de Revistas

Files in This Item:
Files in This Item:
File Description SizeFormat 
ThumbnailCandela_etal_2021_JIS_preprint.pdfPreprint (acceso abierto)273,95 kBAdobe PDFOpen Preview
ThumbnailCandela_etal_2021_JIS_final.pdfVersión final (acceso restringido)458,83 kBAdobe PDFOpen    Request a copy


Items in RUA are protected by copyright, with all rights reserved, unless otherwise indicated.