Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
Ergebnis 6 von 7
Journal of software : evolution and process, 2022-12, Vol.34 (12), p.n/a
2022

Details

Autor(en) / Beteiligte
Titel
WIFLF: An approach independent of the target project for cross‐project defect prediction
Ist Teil von
  • Journal of software : evolution and process, 2022-12, Vol.34 (12), p.n/a
Ort / Verlag
Chichester: Wiley Subscription Services, Inc
Erscheinungsjahr
2022
Link zum Volltext
Quelle
Wiley Online Library
Beschreibungen/Notizen
  • Cross‐project defect prediction (CPDP) is used to build defect prediction models when data from the target project are not enough. There has been several approaches to improve the performance of CPDP, such as feature transformation and instance selection methods. However, existing techniques are strongly dependent on the target data to reduce the distribution discrepancy between source and target projects. That is, the performance of these methods is determined by the effectiveness of feature transformation or the similarity between two projects. Additionally, when there is a large amount of source data that needs to be matched with target data, it will take much time and reduce the efficiency of model construction. Therefore, it is vital to explore a target project‐agnostic approach to build CPDP models. This paper presents a Weighted Isolation Forest with class Label information Filter (WIFLF) to relieve the issues above. Four groups of datasets from AEEEM, Relink and PROMISE Data Repository are used to conduct CPDP models. Besides, WIFLF is compared with 12 approaches. The experimental results indicate that WIFLF significantly outperforms all the baselines. Specifically, WIFLF with random forest significantly improves the performance over the baselines on average by at least 14.64% and 4.90% with respect to Skewed F‐Measure and G‐Measure, respectively. A Weighted Isolation Forest with class Label information Filter (WIFLF) is proposed for instance selection, which is suitable for the scenario that training CPDP models in depending the target project. The algorithm with logistic regression (LR), random forest (RF), and naïve Bayes (NB) is used to 36 releases data sets of 22 projects from commonly available PROMISE Data Repository, AEEEM, and Relink. The comparative experiments are performed with 12 cross‐project defect prediction approaches, and the results indicate WIFLF is significantly outperforms all the baselines. Specifically, WIFLF with RF improves the performance over the baselines on average by at least 14.64% and 4.90% with respect to two overall measures: skewed F‐Measure and G‐Measure, respectively.
Sprache
Englisch
Identifikatoren
ISSN: 2047-7473
eISSN: 2047-7481
DOI: 10.1002/smr.2497
Titel-ID: cdi_proquest_journals_2743782938

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX