University of Hertfordshire

By the same authors

Building an Ensemble for Software Defect Prediction Based on Diversity Selection

Research output: Chapter in Book/Report/Conference proceedingConference contribution


View graph of relations
Original languageEnglish
Title of host publicationProceedings of the 10th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement
Place of PublicationNew York, NY, USA
PublisherACM Press
ISBN (Print)978-1-4503-4427-2
Publication statusPublished - 9 Sep 2016
EventIEEE International Symposium on Empirical Software Engineering and Measurement - Cuidad Real, Spain
Duration: 8 Sep 20169 Sep 2016

Publication series

NameESEM '16


ConferenceIEEE International Symposium on Empirical Software Engineering and Measurement
CityCuidad Real


Background: Ensemble techniques have gained attention
in various scientific fields. Defect prediction researchers have
investigated many state-of-the-art ensemble models and concluded that in many cases these outperform standard single
classifier techniques. Almost all previous work using ensemble
techniques in defect prediction rely on the majority
voting scheme for combining prediction outputs, and on
the implicit diversity among single classifiers. Aim: Investigate
whether defect prediction can be improved using an explicit
diversity technique with stacking ensemble, given the
fact that different classifiers identify different sets of defects.
Method: We used classifiers from four different families and
the weighted accuracy diversity (WAD) technique to exploit
diversity amongst classifiers. To combine individual predictions,
we used the stacking ensemble technique. We used
state-of-the-art knowledge in software defect prediction to
build our ensemble models, and tested their prediction abilities
against 8 publicly available data sets. Conclusion:
The results show performance improvement using stacking
ensembles compared to other defect prediction models. Diversity
amongst classifiers used for building ensembles is essential
to achieving these performance improvements.

ID: 10588304