Data Mining Static Code Attributes To Learn Defect Predictors
Data Mining Static Code Attributes To Learn Defect Predictors
net/publication/220069502
CITATIONS READS
607 164
3 authors, including:
Some of the authors of this publication are also working on these related projects:
Turning lead into gold. Hunting the Snark. Re-animating corpses. Sort of View project
All content following this page was uploaded by Tim Menzies on 13 June 2016.
Comments __________________________________________________________________________________________________________
TABLE 2
The Prediction Results
Table 2 also gives the expected pf values for each data set if the REFERENCES
Precision reaches 60 percent. We can see that, in order to achieve [1] T. Menzies, J. Greenwald, and A. Frank, “Data Mining Static Code
Precision of 60 percent, the original pf values shown in Fig. 12 of Attributes to Learn Defect Predictors,” IEEE Trans. Software Eng., vol. 33,
[1] should be further improved. no. 1, Jan. 2007.
[2] R. Baeza-Yates and B. Ribeiro-Neto, Modern Information Retrieval. ACM
To verify our results, we have also repeated the study described Press, 1999.
in [1], using the same NASA data sets, the naive Bayes (with log- [3] I.H. Witten and E. Frank, Data Mining: Practical Machine Learning Tools and
transforms) learner, and the WEKA tool. The results confirm that Techniques with Java Implementations. Morgan Kaufmann, 1999.
the prediction models proposed in [1] are impractical for software
defect prediction due to the low precisions.
. For more information on this or any other computing topic, please visit our
Digital Library at www.computer.org/publications/dlib.
3 CONCLUSION
In this comment, we have shown that the models built in [1] are
not satisfactory for practical use. We suggest using Recall/
Precision, instead of pd=pf, to measure the accuracy of a software
defect prediction model.
Authorized
View publication stats licensed use limited to: RMIT University. Downloaded on November 23, 2008 at 22:39 from IEEE Xplore. Restrictions apply.