UB Paderborn / Katalog / Suche / Details

Zur Ergebnisliste

Ergebnis 8 von 46

Using Multi-features and Ensemble Learning Method for Imbalanced Malware Classification

2016 IEEE Trustcom/BigDataSE/ISPA, 2016, p.965-973

2016

Volltextzugriff (PDF)

Details

Autor(en) / Beteiligte

Titel

Using Multi-features and Ensemble Learning Method for Imbalanced Malware Classification

Ist Teil von

2016 IEEE Trustcom/BigDataSE/ISPA, 2016, p.965-973

Ort / Verlag

IEEE

Erscheinungsjahr

2016

Quelle

IEL

Beschreibungen/Notizen

The ever-growing malware threats in the cyber spacecalls for techniques that are more effective than widely deployedsignature-based detection system. To counter large volumes ofmalware variants, machine learning techniques have been appliedfor automated malware classification. Despite these efforts haveachieved a certain success, the accuracy and efficiency stillremained inadequate to meet demand, especially when thesemachine learning techniques are used in the situation of multipleclass classification and imbalanced training data. Against thisbackdrop, the goal of this paper is to build a malware classificationsystem that could be used to improve the above mentionedsituation. Our system is based on multiple categories of staticfeatures and ensemble learning method. Compared to sometraditional systems it has the following advantages. Firstly, withmultiple categories of features, our system could classify malwareto their corresponding family effectively and efficiently and at thesame time avoid the influence of evasion in certain extent. Ourmethod don't need any unpacking process and extract featuresfrom the bytes file and disassembled asm file directly. Secondly, the system employed two efficient ensemble learning models, namely XGBoost and ExtraTreeClassifer, and also combinedstacking method to construct the final classifier. Finally, weexperimented our system with the dataset provided by Microsofthosted in Kaggle for malware classification competition, andthe final results show that our method could classify malwareto their family effectively and efficiently with the accuracy of0.9972 in training set and logloss of 0.00395 in testing set. Ourwork not only offer insights into how to use multiple features forclassification, but also shed light on how to develop a scalabletechniques for automated malware classification in practice.

Sprache: Englisch
Identifikatoren: eISSN: 2324-9013
DOI: 10.1109/TrustCom.2016.0163
Titel-ID: cdi_ieee_primary_7847046

Format: –
Schlagworte: Classification algorithms, Feature extraction, Learning systems, Malware, Stacking, Training, Virtual machining

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX

Menü

Weitere Dienste

Einstellungen

Using Multi-features and Ensemble Learning Method for Imbalanced Malware Classification

Details

Weiterführende Literatur