Mostrar el registro sencillo del ítem
Automatic frequency-based feature selection using discrete weighted evolution strategy
dc.contributor.author | Nematzadeh, Hossein | |
dc.contributor.author | García-Nieto, José Manuel | |
dc.contributor.author | Navas-Delgado, Ismael | |
dc.contributor.author | Aldana-Montes, José Francisco | |
dc.date.accessioned | 2023-04-19T11:49:52Z | |
dc.date.available | 2023-04-19T11:49:52Z | |
dc.date.created | 2023-04-19 | |
dc.date.issued | 2022-10-10 | |
dc.identifier.citation | Nematzadeh, H., García-Nieto, J., Navas-Delgado, I., & Aldana-Montes, J. F. (2022). Automatic frequency-based feature selection using discrete weighted evolution strategy. Applied Soft Computing, 130, 109699. | es_ES |
dc.identifier.uri | https://hdl.handle.net/10630/26301 | |
dc.description.abstract | High dimensional datasets usually suffer from curse of dimensionality which may increase the classification time and decrease the classification accuracy beyond a certain dimensionality. Thus, feature selection is used to discard redundant features for improving classification. Nonetheless, there is not a single feature selection method which could deal with all datasets. Thus, this paper proposes an automatic hybrid feature selection incorporating both filter and wrapper methods called Extended Mutual Congestion-Discrete Weighted Evolution Strategy (EMC-DWES). First, Extended Mutual Congestion (EMC) is proposed as a frequency-based filter ranker to discard irrelevant and redundant features using intrinsic statistics of features. Second, Discrete Weighted Evolution Strategy (DWES) is applied on the remaining features selected by EMC to perform the final automatic feature selection within a wrapper method. DWES clusters the features and applies mutation both to select the most relevant feature in each cluster at a time and to avoid selecting redundant features simultaneously through assigning greater weights to most informative clusters. The performance of EMC-DWES (in maximizing classification accuracy and minimizing the selected subset length) is investigated using benchmark high dimensional medical datasets including Covid-19. Likewise, the superiority of EMC-DWES in comparison with state-of-the-art is also evaluated in all datasets. The implementation of EMC-DWES is available on https://github.com/KhaosResearch/EMC-DWES. | es_ES |
dc.description.sponsorship | This work has been partially funded by the Spanish Ministry of Science and Innovation via Grant PID2020-112540RB-C41 (AEI/FEDER, UE) and Andalusian PAIDI program with grant P18-RT-2799. It is also granted by the LifeWatch-ERIC initiative ENVIRONMENTAL AND BIODIVERSITY CLIMATE CHANGE LAB (EnBiC2Lab). Funding for open access charge: Universidad de Málaga / CBUA. | es_ES |
dc.language.iso | eng | es_ES |
dc.publisher | Elsevier | es_ES |
dc.rights | info:eu-repo/semantics/openAccess | es_ES |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | * |
dc.subject | Análisis de datos | es_ES |
dc.subject.other | Curse of dimensionality | es_ES |
dc.subject.other | Automatic hybrid feature selection | es_ES |
dc.subject.other | Filter | es_ES |
dc.subject.other | Wrapper | es_ES |
dc.subject.other | High dimensional medical datasets | es_ES |
dc.subject.other | COVID-19 | es_ES |
dc.title | Automatic frequency-based feature selection using discrete weighted evolution strategy | es_ES |
dc.type | info:eu-repo/semantics/article | es_ES |
dc.centro | E.T.S.I. Telecomunicación | es_ES |
dc.identifier.doi | 10.1016/j.asoc.2022.109699 | |
dc.rights.cc | Atribución 4.0 Internacional | * |
dc.type.hasVersion | info:eu-repo/semantics/publishedVersion | es_ES |