TY - GEN
T1 - Comparison of outlier detection methods in fault-proneness models
AU - Matsumoto, Shinsuke
AU - Kamei, Yasutaka
AU - Monden, Akito
AU - Matsumoto, Ken Ichi
N1 - Copyright:
Copyright 2008 Elsevier B.V., All rights reserved.
PY - 2007
Y1 - 2007
N2 - In this paper, we experimentally evaluated the effect of outlier detection methods to improve the prediction performance of fault-proneness models. Detected outliers were removed from a fit dataset before building a model. In the experiment, we compared three outlier detection methods (Mahalanobis outlier analysis (MOA), local outlier factor method (LOFM) and rule based modeling (RBM)) each applied to three well-known fault-proneness models (linear discriminant analysis (LDA), logistic regression analysis (LRA) and classification tree (CT)). As a result, MOA and RBM improved F1-values of all models (0.04 at minimum, 0.17 at maximum and 0.10 at mean) while improvements by LOFM were relatively small (-0.01 at minimum, 0.04 at maximum and 0.01 at mean).
AB - In this paper, we experimentally evaluated the effect of outlier detection methods to improve the prediction performance of fault-proneness models. Detected outliers were removed from a fit dataset before building a model. In the experiment, we compared three outlier detection methods (Mahalanobis outlier analysis (MOA), local outlier factor method (LOFM) and rule based modeling (RBM)) each applied to three well-known fault-proneness models (linear discriminant analysis (LDA), logistic regression analysis (LRA) and classification tree (CT)). As a result, MOA and RBM improved F1-values of all models (0.04 at minimum, 0.17 at maximum and 0.10 at mean) while improvements by LOFM were relatively small (-0.01 at minimum, 0.04 at maximum and 0.01 at mean).
UR - http://www.scopus.com/inward/record.url?scp=47949090643&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=47949090643&partnerID=8YFLogxK
U2 - 10.1109/ESEM.2007.34
DO - 10.1109/ESEM.2007.34
M3 - Conference contribution
AN - SCOPUS:47949090643
SN - 0769528864
SN - 9780769528861
T3 - Proceedings - 1st International Symposium on Empirical Software Engineering and Measurement, ESEM 2007
SP - 461
EP - 463
BT - Proceedings - 1st International Symposium on Empirical Software Engineering and Measurement, ESEM 2007
T2 - 1st International Symposium on Empirical Software Engineering and Measurement, ESEM 2007
Y2 - 20 September 2007 through 21 September 2007
ER -