KD SENSO-MERGER: An architecture for semantic integration of heterogeneous data

Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10045/140267
Información del item - Informació de l'item - Item information
Título: KD SENSO-MERGER: An architecture for semantic integration of heterogeneous data
Autor/es: Gutiérrez, Yoan | Abreu Salas, José Ignacio | Montoyo, Andres | Muñoz, Rafael | Estévez-Velarde, Suilan
Grupo/s de investigación o GITE: Procesamiento del Lenguaje y Sistemas de Información (GPLSI)
Centro, Departamento o Servicio: Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos
Palabras clave: Heterogeneous data | Knowledge discovery | NERC | Natural language processing | Ontology and knowledge representation | Semantic data integration
Fecha de publicación: 19-ene-2024
Editor: Elsevier
Cita bibliográfica: Engineering Applications of Artificial Intelligence. 2024, 132: 107854. https://doi.org/10.1016/j.engappai.2024.107854
Resumen: This paper presents KD SENSO-MERGER, a novel Knowledge Discovery (KD) architecture that is capable of semantically integrating heterogeneous data from various sources of structured and unstructured data (i.e. geolocations, demographic, socio-economic, user reviews, and comments). This goal drives the main design approach of the architecture. It works by building internal representations that adapt and merge knowledge across multiple domains, ensuring that the knowledge base is continuously updated. To deal with the challenge of integrating heterogeneous data, this proposal puts forward the corresponding solutions: (i) knowledge extraction, addressed via a plugin-based architecture of knowledge sensors; (ii) data integrity, tackled by an architecture designed to deal with uncertain or noisy information; (iii) scalability, this is also supported by the plugin-based architecture as only relevant knowledge to the scenario is integrated by switching-off non-relevant sensors. Also, we minimize the expert knowledge required, which may pose a bottleneck when integrating a fast-paced stream of new sources. As proof of concept, we developed a case study that deploys the architecture to integrate population census and economic data, municipal cartography, and Google Reviews to analyze the socio-economic contexts of educational institutions. The knowledge discovered enables us to answer questions that are not possible through individual sources. Thus, companies or public entities can discover patterns of behavior or relationships that would otherwise not be visible and this would allow extracting valuable information for the decision-making process.
Patrocinador/es: This research is supported by the University of Alicante, Spain, the Spanish Ministry of Science and Innovation, the Generalitat Valenciana, Spain, and the European Regional Development Fund (ERDF) through the following funding: At the national level, the following projects were granted: TRIVIAL (PID2021-122263OB-C22); and CORTEX (PID2021-123956OB-I00), funded by MCIN/AEI/10.13039/501100011033 and, as appropriate, by ‘‘ERDF A way of making Europe’’, by the ‘‘European Union’’ or by the ‘‘European Union NextGenerationEU/PRTR’’. At regional level, the Generalitat Valenciana (Conselleria d’Educacio, Investigacio, Cultura i Esport), Spain, granted funding for NL4DISMIS (CIPROM/2021/21).
URI: http://hdl.handle.net/10045/140267
ISSN: 0952-1976 (Print) | 1873-6769 (Online)
DOI: 10.1016/j.engappai.2024.107854
Idioma: eng
Tipo: info:eu-repo/semantics/article
Derechos: © 2024 Elsevier Ltd.
Revisión científica: si
Versión del editor: https://doi.org/10.1016/j.engappai.2024.107854
Aparece en las colecciones:INV - GPLSI - Artículos de Revistas

Archivos en este ítem:
Archivos en este ítem:
Archivo Descripción TamañoFormato 
ThumbnailGutierrez_etal_2024_EngApplArtificIntellig_final.pdfVersión final (acceso restringido)3,59 MBAdobe PDFAbrir    Solicitar una copia
ThumbnailGutierrez_etal_2024_EngApplArtificIntellig_preprint.pdfPreprint (acceso abierto)10,94 MBAdobe PDFAbrir Vista previa


Todos los documentos en RUA están protegidos por derechos de autor. Algunos derechos reservados.