Professional Documents
Culture Documents
Brief PHD
Brief PHD
Software test automation has undeniably improved the entire software test process bringing
speed, accuracy and wider test coverage, consistency, cost saving and provides for frequent
and thorough testing. However, there has been a consistent problem of class imbalance,
presence of null values and high dimensionality of defects in existing data sets which can
impact on the classification of defects and result in wrong prediction of the model. Most
existing defect detection models focus more on the model’s accuracy and performance rather
than the actual defect return rate and this in turn neglects the initial purpose of the model in
Defect return rate among existing models is not optimal. This study attempts to develop a
more efficient classifier and apply a suitable boosting technique, class balancing and
dropping of Null values to boost the resultant algorithm to obtain a better prediction model
This research aims to address the challenges of software defect prediction by designing and
implementing a modified classification model that can overcome issues of class imbalance,
research:
1. Identify the challenges of class imbalance, null values, overlapping and high
2. Implement suitable class balancing, null value dropping and high dimensionality
classifier.
5. Compare the performance of the modified models with existing models to determine
average of over 80% across all the metrics and 86% overall performance which makes it a
good classifier. Only Random Forest bettered our classifier on this data set.