Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
Ergebnis 19 von 593
Decision Support Systems, 2012-04, Vol.53 (1), p.226-233
2012
Volltextzugriff (PDF)

Details

Autor(en) / Beteiligte
Titel
Preprocessing unbalanced data using support vector machine
Ist Teil von
  • Decision Support Systems, 2012-04, Vol.53 (1), p.226-233
Ort / Verlag
Amsterdam: Elsevier B.V
Erscheinungsjahr
2012
Quelle
Elsevier ScienceDirect Journals
Beschreibungen/Notizen
  • This paper deals with the application of support vector machine (SVM) to deal with the class imbalance problem. The objective of this paper is to examine the feasibility and efficiency of SVM as a preprocessor. Our study analyzes different classification algorithms that are employed to predict the customers with caravan car policy based on his/her sociodemographic data and history of product ownership. A series of experiments was conducted to test various computational intelligence techniques viz., Multilayer Perceptron (MLP), Logistic Regression (LR), and Random Forest (RF). Various standard balancing techniques such as under-sampling, over-sampling and Synthetic Minority Over-sampling TEchnique (SMOTE) are also employed. Subsequently, a strategy of data balancing for handling imbalanced distribution in data is proposed. The proposed approach first employs SVM as a preprocessor and the actual target values of training data are then replaced by the predictions of trained SVM. Later, this modified training data is used to train techniques such as MLP, LR, and RF. Based on the measure of sensitivity, it is observed that the proposed approach not only balances the data effectively but also provides more number of instances for minority class, which in turn enhances the performance of the intelligence techniques. ►Support vector machine (SVM) acts as a preprocessor for unbalanced data. ►SVM generates extra data related to minority class. ►The modified training data is used to train multiple classification techniques. ►The hybrid approach performs well in terms of sensitivity.

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX