Code cleaning for software defect prediction:A cautionary tale

Shippey, T. and Bowes, D. and Counsell, S. and Hall, T. (2018) Code cleaning for software defect prediction:A cautionary tale. In: 2018 44th Euromicro Conference on Software Engineering and Advanced Applications (SEAA). IEEE, pp. 239-243. ISBN 9781538673836

Full text not available from this repository.


In this paper, we describe our experience of developing a new technique to improve defect prediction (code cleaning) which performed very encouragingly on the first two systems on which we evaluated it (both systems had their origins in one company). Code cleaning also worked well on an additional open source system (Eclipse). But our code cleaning technique then performed disappointingly on all 69 subsequent open source systems on which we evaluated it. Without our round two evaluations on these 69 open source systems we would have published misleading prediction results. We discuss the need for performance evaluations to be performed on carefully selected samples of systems if reliable conclusions are to be drawn.

Item Type:
Contribution in Book/Report/Proceedings
ID Code:
Deposited By:
Deposited On:
18 Mar 2019 11:35
Last Modified:
17 Sep 2023 04:03