학술논문

Validity Analysis of Software Defect Prediction Model for Mis-label Correction Based on CleanLab
Document Type
Conference
Source
2023 10th International Conference on Dependable Systems and Their Applications (DSA) DSA Dependable Systems and Their Applications (DSA), 2023 10th International Conference on. :233-241 Aug, 2023
Subject
Computing and Processing
Training
Learning systems
Analytical models
Training data
Predictive models
Software
Data models
Software Defect Prediction
Mis-Label
Confide nce learning
Language
ISSN
2767-6684
Abstract
In code-based software defect prediction models, the problem of mislabeling often occurs, resulting in reduced model performance and inaccurate decisions. To tackle this issue, this study presents Cleanlab, a confidence learning-based method, for detecting and rectifying mis-labels. Unlike conventional approaches, our emphasis is on code-based defect prediction models. We validate the efficacy of Cleanlab by conducting case studies on 32 versions of nine projects, using three well-known models: CNN, DBN, and LINE-DP. The experimental findings demonstrate substantial enhancements in the robustness and accuracy of our code-based defect prediction model when incorporating Cleanlab.