A Data Analytics Methodology to Visually Analyze the impact of Bias and Rebalancing

Empreu sempre aquest identificador per citar o enllaçar aquest ítem http://hdl.handle.net/10045/134700
Registre complet
Registre complet
Camp Dublin Core Valor Idioma
dc.contributorLucentiaes_ES
dc.contributor.authorLavalle, Ana-
dc.contributor.authorMaté, Alejandro-
dc.contributor.authorTrujillo, Juan-
dc.contributor.authorTeruel, Miguel A.-
dc.contributor.otherUniversidad de Alicante. Departamento de Lenguajes y Sistemas Informáticoses_ES
dc.date.accessioned2023-05-29T06:22:36Z-
dc.date.available2023-05-29T06:22:36Z-
dc.date.issued2023-05-24-
dc.identifier.citationIEEE Access. 2023, 11: 56691-56702. https://doi.org/10.1109/ACCESS.2023.3279732es_ES
dc.identifier.issn2169-3536-
dc.identifier.urihttp://hdl.handle.net/10045/134700-
dc.description.abstractData Analytics have become a key component of many business processes which influence several aspects of our daily life. Indeed, any misinterpretation or flaw in the outputs of Data Analytics results can cause significant damage, specialy when dealing with one of the most often overlooked issues, namely the unaware use of biased data. When data bias goes unadverted, it warps the meaning of data, having a devastating effect on Data Analytics results. Although it is widely argued that the most common manner to deal with data bias is to rebalance biased datasets, it is not an aseptic transformation, leading to several potentially undesired side-effects that will probably harm the result of data analyses. Therefore, in order to analyze the underlying bias in datasets, in this work we present (i) a comprehensive methodology based on visualization techniques, which assists users in the definition of their analytical requirements to detect and visually represent the data bias automatically helping them to find out whether it is appropriate to artificially rebalance their dataset or not; (ii) a novel metamodel for visually representing bias; (iii) a motivating real-world running example used to analyze the impact of bias in Data Analytics and (iv) an assessment of the improvements introduced by our proposal through a complete real-world case study by using a Fire Department Calls for Service dataset, thus demonstrating that rebalancing datasets is not always the best option. It is crucial to study the context where the decisions are going to be taken. Moreover, it is also important to do a pre-analysis with the aim of knowing the nature of the datasets and how biased they are.es_ES
dc.description.sponsorshipThis work has been co-funded by the AETHER-UA project (PID2020-112540RB-C43) funded by Spanish Ministry of Science and Innovation and the BALLADEER (PROMETEO /2021/088) project funded by the Conselleria de Innovación, Universidades, Ciencia y Sociedad Digital (Generalitat Valenciana).es_ES
dc.languageenges_ES
dc.publisherIEEEes_ES
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. For more information, see https://creativecommons.org/licenses/by-nc-nd/4.0/es_ES
dc.subjectData Analyticses_ES
dc.subjectData Biases_ES
dc.subjectData Visualizationes_ES
dc.subjectModel-driven developmentes_ES
dc.subjectRequirements Engineeringes_ES
dc.subjectArtificial Intelligencees_ES
dc.titleA Data Analytics Methodology to Visually Analyze the impact of Bias and Rebalancinges_ES
dc.typeinfo:eu-repo/semantics/articlees_ES
dc.peerreviewedsies_ES
dc.identifier.doi10.1109/ACCESS.2023.3279732-
dc.relation.publisherversionhttps://doi.org/10.1109/ACCESS.2023.3279732es_ES
dc.rights.accessRightsinfo:eu-repo/semantics/openAccesses_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2020-112540RB-C43es_ES
Apareix a la col·lecció: INV - LUCENTIA - Artículos de Revistas

Arxius per aquest ítem:
Arxius per aquest ítem:
Arxiu Descripció Tamany Format  
ThumbnailLavalle_etal_2023_IEEEAccess.pdf1,89 MBAdobe PDFObrir Vista prèvia


Aquest ítem està subjecte a una llicència de Creative Commons Llicència Creative Commons Creative Commons