Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
Ergebnis 3 von 72
Acta Universitatis Lodziensis. Folia oeconomica, 2018-01, Vol.6 (339), p.7-16
2018
Volltextzugriff (PDF)

Details

Autor(en) / Beteiligte
Titel
The Problem of Redundant Variables in Random Forests
Ist Teil von
  • Acta Universitatis Lodziensis. Folia oeconomica, 2018-01, Vol.6 (339), p.7-16
Ort / Verlag
Lodz: Wydawnictwo Uniwersytetu Łódzkiego
Erscheinungsjahr
2018
Quelle
EZB Electronic Journals Library
Beschreibungen/Notizen
  • Random forests are currently one of the most preferable methods of supervised learning among practitioners. Their popularity is influenced by the possibility of applying this method without a time consuming pre-processing step. Random forests can be used for mixed types of features, irrespectively of their distributions. The method is robust to outliers, and feature selection is built into the learning algorithm. However, a decrease of classification accuracy can be observed in the presence of redundant variables. In this paper, we discuss two approaches to the problem of redundant varia¬bles. We consider two strategies of searching for best feature subset as well as two formulas of aggregating the features in the clusters. In the empirical experiment, we generate collinear predictors and include them in the real datasets. Dimensionality reduction methods usually improve the accuracy of random forests, but none of them clearly outperforms the others.
Sprache
Englisch
Identifikatoren
ISSN: 0208-6018
eISSN: 2353-7663
DOI: 10.18778/0208-6018.339.01
Titel-ID: cdi_proquest_journals_2188845484

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX