Feature selection based on information theory

Bonev, Boyan

Feature selection based on information theory

Empreu sempre aquest identificador per citar o enllaçar aquest ítem http://hdl.handle.net/10045/18362

Registre complet

Registre complet
Camp Dublin Core	Valor	Idioma
dc.contributor.advisor	Cazorla Quevedo, Miguel Ángel	-
dc.contributor.advisor	Escolano Ruiz, Francisco	-
dc.contributor.author	Bonev, Boyan	-
dc.contributor.other	Universidad de Alicante. Departamento de Ciencia de la Computación e Inteligencia Artificial	en
dc.date.accessioned	2011-07-27T12:02:52Z	-
dc.date.available	2011-07-27T12:02:52Z	-
dc.date.created	2010	-
dc.date.issued	2010	-
dc.date.submitted	2010-06-29	-
dc.identifier.isbn	978-84-694-1634-1	-
dc.identifier.uri	http://hdl.handle.net/10045/18362	-
dc.description.abstract	Along with the improvement of data acquisition techniques and the increasing computational capacity of computers, the dimensionality of the data grows higher. Pattern recognition methods have to deal with samples consisting of thousands of features and the reduction of their dimensionality becomes crucial to make them tractable. Feature selection is a technique for removing the irrelevant and noisy features and selecting a subset of features which describe better the samples and produce a better classification performance. It is becoming an essential part of most pattern recognition applications.	en
dc.description.abstract	In this thesis we propose a feature selection method for supervised classification. The main contribution is the efficient use of information theory, which provides a solid theoretical framework for measuring the relation between the classes and the features. Mutual information is considered to be the best measure for such purpose. Traditionally it has been measured for ranking single features without taking into account the entire set of selected features. This is due to the computational complexity involved in estimating the mutual information. However, in most data sets the features are not independent and their combination provides much more information about the class, than the sum of their individual prediction power.	en
dc.description.abstract	Methods based on density estimation can only be used for data sets with a very high number of samples and low number of features. Due to the curse of dimensionality, in a multi-dimensional feature space the amount of samples required for a reliable density estimation is very high. For this reason we analyse the use of different estimation methods which bypass the density estimation and estimate entropy directly from the set of samples. These methods allow us to efficiently evaluate sets of thousands of features.	en
dc.description.abstract	For high-dimensional feature sets another problem is the search order of the feature space. All non-prohibitive computational cost algorithms search for a sub-optimal feature set. Greedy algorithms are the fastest and are the ones which incur less overfitting. We show that from the information theoretical perspective, a greedy backward selection algorithm conserves the amount of mutual information, even though the feature set is not the minimal one.	en
dc.description.abstract	We also validate our method in several real-world applications. We apply feature selection to omnidirectional image classification through a novel approach. It is appearance-based and we select features from a bank of filters applied to different parts of the image. The context of the task is place recognition for mobile robotics. Another set of experiments are performed on microarrays from gene expression databases. The classification problem aims to predict the disease of a new patient. We present a comparison of the classification performance and the algorithms we present showed to outperform the existing ones. Finally, we succesfully apply feature selection to spectral graph classification. All the features we use are for unattributed graphs, which constitutes a contribution to the field. We also draw interesting conclusions about which spectral features matter most, under different experimental conditions. In the context of graph classification we also show important is the precise estimation of mutual information and we analyse its impact on the final classification results.	en
dc.language	eng	en
dc.publisher	Universidad de Alicante	en
dc.subject	Feature selection	en
dc.subject	Information theory	en
dc.subject	Pattern recognition	en
dc.subject	Supervised classification	en
dc.subject	Selección de características	en
dc.subject	Teoría de la información	en
dc.subject	Reconocimiento de patrones	en
dc.subject	Clasificación supervisada	en
dc.subject.other	Ciencia de la Computación e Inteligencia Artificial	en
dc.title	Feature selection based on information theory	en
dc.type	info:eu-repo/semantics/doctoralThesis	en
dc.rights.accessRights	info:eu-repo/semantics/openAccess	-
Apareix a la col·lecció:	Tesis doctorals

Arxius per aquest ítem:

Arxius per aquest ítem:
Arxiu	Descripció	Tamany	Format
tesis_Ivanov.pdf		3,63 MB	Adobe PDF	Obrir Vista prèvia Tancar vista prèvia

Veure citacions a Google Académic

Mostrar el registre simplificat de l'ítem

Aquest ítem està subjecte a una llicència de Creative Commons Llicència Creative Commons