Espinosa, Roberto, Mazón, Jose-Norberto, Zubcoff, Jose Towards a reverse engineering approach for guiding user in applying data mining URI: http://hdl.handle.net/10045/25166 DOI: ISSN: Abstract: Data mining is at the core of the knowledge discovery process. However, an initial preprocessing step is crucial for assuring reliable results within this process. Preprocessing of data is a time-consuming and non-trivial task since data quality issues should be considered. This is even worst when dealing with complex data, not only because of the different kind of complex data types (XML, multimedia, and so on), but also because of the high dimensionality of complex data. Therefore, to overcome this situation, in this position paper we propose using mechanisms based on data reverse engineering for automatically measuring some data quality criteria on the data sources. These measures will guide user in selecting the most adequate data mining algorithm in the early stages of the knowledge discovery process. Finally, it is worth noting that this work is a first step towards considering, in a systematic and structured manner, data quality criteria for supporting data miners in applying those algorithms that obtain the most reliable knowledge from the available data sources. Keywords:Data mining, Data reverse engineering, Data quality, Knowledge discovery process info:eu-repo/semantics/conferenceObject