Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
Ergebnis 20 von 115
Information sciences, 2020-03, Vol.513, p.429-441
2020
Volltextzugriff (PDF)

Details

Autor(en) / Beteiligte
Titel
Data imbalance in classification: Experimental evaluation
Ist Teil von
  • Information sciences, 2020-03, Vol.513, p.429-441
Ort / Verlag
Elsevier Inc
Erscheinungsjahr
2020
Quelle
Alma/SFX Local Collection
Beschreibungen/Notizen
  • The advent of Big Data has ushered a new era of scientific breakthroughs. One of the common issues that affects raw data is class imbalance problem which refers to imbalanced distribution of values of the response variable. This issue is present in fraud detection, network intrusion detection, medical diagnostics, and a number of other fields where negatively labeled instances significantly outnumber positively labeled instances. Modern machine learning techniques struggle to deal with imbalanced data by focusing on minimizing the error rate for the majority class while ignoring the minority class. The goal of our paper is demonstrate the effects of class imbalance on classification models. Concretely, we study the impact of varying class imbalance ratios on classifier accuracy. By highlighting the precise nature of the relationship between the degree of class imbalance and the corresponding effects on classifier performance we hope to help researchers to better tackle the problem. To this end, we carry out extensive experiments using 10-fold cross validation on a large number of datasets. In particular, we determine that the relationship between the class imbalance ratio and the accuracy is convex.
Sprache
Englisch
Identifikatoren
ISSN: 0020-0255
eISSN: 1872-6291
DOI: 10.1016/j.ins.2019.11.004
Titel-ID: cdi_crossref_primary_10_1016_j_ins_2019_11_004

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX