UB Paderborn / Katalog / Suche / Details

Ergebnis 8 von 267

Pattern recognition letters, 2012-10, Vol.33 (13), p.1794-1804

2012

Volltextzugriff (PDF)

Autor(en) / Beteiligte

Titel

Efficient feature selection filters for high-dimensional data

Ist Teil von

Ort / Verlag

Elsevier B.V

Erscheinungsjahr

2012

Quelle

Alma/SFX Local Collection

Beschreibungen/Notizen

► Two new log-linear time filter unsupervised feature selection methods. ► Relevance computed by dispersion; redundancy computed efficiently. ► Suited for binary and multi-class high-dimensional datasets. ► Very fast FS results on standard benchmark datasets. Feature selection is a central problem in machine learning and pattern recognition. On large datasets (in terms of dimension and/or number of instances), using search-based or wrapper techniques can be computationally prohibitive. Moreover, many filter methods based on relevance/redundancy assessment also take a prohibitively long time on high-dimensional datasets. In this paper, we propose efficient unsupervised and supervised feature selection/ranking filters for high-dimensional datasets. These methods use low-complexity relevance and redundancy criteria, applicable to supervised, semi-supervised, and unsupervised learning, being able to act as pre-processors for computationally intensive methods to focus their attention on smaller subsets of promising features. The experimental results, with up to 105 features, show the time efficiency of our methods, with lower generalization error than state-of-the-art techniques, while being dramatically simpler and faster.

Sprache: Englisch
Identifikatoren: ISSN: 0167-8655
eISSN: 1872-7344
DOI: 10.1016/j.patrec.2012.05.019
Titel-ID: cdi_crossref_primary_10_1016_j_patrec_2012_05_019

Format: –
Schlagworte: Dispersion measures, Feature selection, Filters, High-dimensional data, Similarity measures

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX