UB Paderborn / Katalog / Suche / Details

Zur Ergebnisliste

Ergebnis 24 von 8845

Framework for the Ensemble of Feature Selection Methods

Applied sciences, 2021-09, Vol.11 (17), p.8122

2021

Details

Autor(en) / Beteiligte

Titel

Framework for the Ensemble of Feature Selection Methods

Ist Teil von

Applied sciences, 2021-09, Vol.11 (17), p.8122

Ort / Verlag

Basel: MDPI AG

Erscheinungsjahr

2021

Link zum Volltext

Quelle

Free E-Journal (出版社公開部分のみ）

Beschreibungen/Notizen

Feature selection (FS) has attracted the attention of many researchers in the last few years due to the increasing sizes of datasets, which contain hundreds or thousands of columns (features). Typically, not all columns represent relevant values. Consequently, the noise or irrelevant columns could confuse the algorithms, leading to a weak performance of machine learning models. Different FS algorithms have been proposed to analyze highly dimensional datasets and determine their subsets of relevant features to overcome this problem. However, very often, FS algorithms are biased by the data. Thus, methods for ensemble feature selection (EFS) algorithms have become an alternative to integrate the advantages of single FS algorithms and compensate for their disadvantages. The objective of this research is to propose a conceptual and implementation framework to understand the main concepts and relationships in the process of aggregating FS algorithms and to demonstrate how to address FS on datasets with high dimensionality. The proposed conceptual framework is validated by deriving an implementation framework, which incorporates a set of Phyton packages with functionalities to support the assembly of feature selection algorithms. The performance of the implementation framework was demonstrated in several experiments discovering relevant features in the Sonar, SPECTF, and WDBC datasets. The experiments contrasted the accuracy of two machine learning classifiers (decision tree and logistic regression), trained with subsets of features generated either by single FS algorithms or the set of features selected by the ensemble feature selection framework. We observed that for the three datasets used (Sonar, SPECTF, and WD), the highest precision percentages (86.95%, 74.73%, and 93.85%, respectively) were obtained when the classifiers were trained with the subset of features generated by our framework. Additionally, the stability of the feature sets generated using our ensemble method was evaluated. The results showed that the method achieved perfect stability for the three datasets used in the evaluation.

Sprache: Englisch
Identifikatoren: ISSN: 2076-3417
eISSN: 2076-3417
DOI: 10.3390/app11178122
Titel-ID: cdi_doaj_primary_oai_doaj_org_article_8b716e6ce0364d54b3daae6a6d6467c8

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX

Menü

Weitere Dienste

Einstellungen

Framework for the Ensemble of Feature Selection Methods

Details

Weiterführende Literatur