Leveraging Large Language Models for Sensor Data Retrieval

Berenguer, Alberto; Morejón, Adriana; Tomás, David; Mazón, Jose-Norberto

Leveraging Large Language Models for Sensor Data Retrieval

Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10045/141551

Registro completo de metadatos

Registro completo de metadatos
Campo DC	Valor	Idioma
dc.contributor	Procesamiento del Lenguaje y Sistemas de Información (GPLSI)	es_ES
dc.contributor	Web and Knowledge (WaKe)	es_ES
dc.contributor.author	Berenguer, Alberto	-
dc.contributor.author	Morejón, Adriana	-
dc.contributor.author	Tomás, David	-
dc.contributor.author	Mazón, Jose-Norberto	-
dc.contributor.other	Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos	es_ES
dc.date.accessioned	2024-03-20T09:34:36Z	-
dc.date.available	2024-03-20T09:34:36Z	-
dc.date.issued	2024-03-15	-
dc.identifier.citation	Applied Sciences. 2024, 14(6): 2506. https://doi.org/10.3390/app14062506	es_ES
dc.identifier.issn	2076-3417	-
dc.identifier.uri	http://hdl.handle.net/10045/141551	-
dc.description.abstract	The growing significance of sensor data in the development of information technology services finds obstacles due to disparate data presentations and non-adherence to FAIR principles. This paper introduces a novel approach for sensor data gathering and retrieval. The proposal leverages large language models to convert sensor data into FAIR-compliant formats and to provide word embedding representations of tabular data for subsequent exploration, enabling semantic comparison. The proposed system comprises two primary components. The first focuses on gathering data from sensors and converting it into a reusable structured format, while the second component aims to identify the most relevant sensor data to augment a given user-provided dataset. The evaluation of the proposed approach involved comparing the performance of various large language models in generating representative word embeddings for each table to retrieve related sensor data. The results show promising performance in terms of precision and MRR (0.90 and 0.94 for the best-performing model, respectively), indicating the system’s ability to retrieve pertinent sensor data that fulfil user requirements.	es_ES
dc.description.sponsorship	This research was partially funded by MCIN/AEI/10.13039/501100011033 and by the European Union Next Generation EU/PRTR as part of the projects TED2021130890B-C21 and PID2021-122263OB-C22, as well a by REMARKABLE project (HORIZON-MSCA-2021-SE-0 action number: 101086387). The APC was funded by CIAICO/2022/019 project from Generalitat Valenciana. Alberto Berenguer has a contract for predoctoral training with “Generalitat Valenciana” and the European Social Fund, funded by grant number ACIF/2021/507.	es_ES
dc.language	eng	es_ES
dc.publisher	MDPI	es_ES
dc.rights	© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).	es_ES
dc.subject	Sensor data	es_ES
dc.subject	Large language models	es_ES
dc.subject	Word embeddings	es_ES
dc.subject	Data retrieval	es_ES
dc.subject	FAIR principles	es_ES
dc.title	Leveraging Large Language Models for Sensor Data Retrieval	es_ES
dc.type	info:eu-repo/semantics/article	es_ES
dc.peerreviewed	si	es_ES
dc.identifier.doi	10.3390/app14062506	-
dc.relation.publisherversion	https://doi.org/10.3390/app14062506	es_ES
dc.rights.accessRights	info:eu-repo/semantics/openAccess	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/TED2021-130890B-C21	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2021-122263OB-C22	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/EC/H2020/101086387	es_ES
Aparece en las colecciones:	INV - GPLSI - Artículos de Revistas INV - WaKe - Artículos de Revistas Investigaciones financiadas por la UE

Archivos en este ítem:

Archivos en este ítem:
Archivo	Descripción	Tamaño	Formato
Berenguer_etal_2024_ApplSci.pdf		1,1 MB	Adobe PDF	Abrir Vista previa Cerrar vista previa

Ver citas en Google Académico

Muestra el registro sencillo