KD SENSO-MERGER: An architecture for semantic integration of heterogeneous data

Empreu sempre aquest identificador per citar o enllaçar aquest ítem http://hdl.handle.net/10045/140267
Información del item - Informació de l'item - Item information
Títol: KD SENSO-MERGER: An architecture for semantic integration of heterogeneous data
Autors: Gutiérrez, Yoan | Abreu Salas, José Ignacio | Montoyo, Andres | Muñoz, Rafael | Estévez-Velarde, Suilan
Grups d'investigació o GITE: Procesamiento del Lenguaje y Sistemas de Información (GPLSI)
Centre, Departament o Servei: Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos
Paraules clau: Heterogeneous data | Knowledge discovery | NERC | Natural language processing | Ontology and knowledge representation | Semantic data integration
Data de publicació: 19-de gener-2024
Editor: Elsevier
Citació bibliogràfica: Engineering Applications of Artificial Intelligence. 2024, 132: 107854. https://doi.org/10.1016/j.engappai.2024.107854
Resum: This paper presents KD SENSO-MERGER, a novel Knowledge Discovery (KD) architecture that is capable of semantically integrating heterogeneous data from various sources of structured and unstructured data (i.e. geolocations, demographic, socio-economic, user reviews, and comments). This goal drives the main design approach of the architecture. It works by building internal representations that adapt and merge knowledge across multiple domains, ensuring that the knowledge base is continuously updated. To deal with the challenge of integrating heterogeneous data, this proposal puts forward the corresponding solutions: (i) knowledge extraction, addressed via a plugin-based architecture of knowledge sensors; (ii) data integrity, tackled by an architecture designed to deal with uncertain or noisy information; (iii) scalability, this is also supported by the plugin-based architecture as only relevant knowledge to the scenario is integrated by switching-off non-relevant sensors. Also, we minimize the expert knowledge required, which may pose a bottleneck when integrating a fast-paced stream of new sources. As proof of concept, we developed a case study that deploys the architecture to integrate population census and economic data, municipal cartography, and Google Reviews to analyze the socio-economic contexts of educational institutions. The knowledge discovered enables us to answer questions that are not possible through individual sources. Thus, companies or public entities can discover patterns of behavior or relationships that would otherwise not be visible and this would allow extracting valuable information for the decision-making process.
Patrocinadors: This research is supported by the University of Alicante, Spain, the Spanish Ministry of Science and Innovation, the Generalitat Valenciana, Spain, and the European Regional Development Fund (ERDF) through the following funding: At the national level, the following projects were granted: TRIVIAL (PID2021-122263OB-C22); and CORTEX (PID2021-123956OB-I00), funded by MCIN/AEI/10.13039/501100011033 and, as appropriate, by ‘‘ERDF A way of making Europe’’, by the ‘‘European Union’’ or by the ‘‘European Union NextGenerationEU/PRTR’’. At regional level, the Generalitat Valenciana (Conselleria d’Educacio, Investigacio, Cultura i Esport), Spain, granted funding for NL4DISMIS (CIPROM/2021/21).
URI: http://hdl.handle.net/10045/140267
ISSN: 0952-1976 (Print) | 1873-6769 (Online)
DOI: 10.1016/j.engappai.2024.107854
Idioma: eng
Tipus: info:eu-repo/semantics/article
Drets: © 2024 Elsevier Ltd.
Revisió científica: si
Versió de l'editor: https://doi.org/10.1016/j.engappai.2024.107854
Apareix a la col·lecció: INV - GPLSI - Artículos de Revistas

Arxius per aquest ítem:
Arxius per aquest ítem:
Arxiu Descripció Tamany Format  
ThumbnailGutierrez_etal_2024_EngApplArtificIntellig_final.pdfVersión final (acceso restringido)3,59 MBAdobe PDFObrir     Sol·licitar una còpia
ThumbnailGutierrez_etal_2024_EngApplArtificIntellig_preprint.pdfPreprint (acceso abierto)10,94 MBAdobe PDFObrir Vista prèvia


Tots els documents dipositats a RUA estan protegits per drets d'autors. Alguns drets reservats.