TY - GEN
T1 - A hybrid faulty module prediction using association rule mining and logistic regression analysis
AU - Kamei, Yasutaka
AU - Monden, Akito
AU - Morisaki, Shuji
AU - Matsumoto, Ken Ichi
PY - 2008
Y1 - 2008
N2 - This paper proposes a fault-prone module prediction method that combines association rule mining with logistic regression analysis. In the proposed method, we focus on three key measures of interestingness of an association rule (support, confidence and lift) to select useful rules for the prediction. If a module satisfies the premise (i.e. the condition in the antecedent part) of one of the selected rules, the module is classified by the rule as either fault-prone or not. Otherwise, the module is classified by the logistic model. We experimentally evaluated the prediction performance of the proposed method with different thresholds of each rule interestingness measure (support, confidence and lift) using a module set in the Eclipse project, and compared it with three well-known fault-proneness models (logistic regression model, linear discriminant model and classification tree). The result showed that the improvement of the Fl-value of the proposed method was 0.163 at maximum compared to conventional models.
AB - This paper proposes a fault-prone module prediction method that combines association rule mining with logistic regression analysis. In the proposed method, we focus on three key measures of interestingness of an association rule (support, confidence and lift) to select useful rules for the prediction. If a module satisfies the premise (i.e. the condition in the antecedent part) of one of the selected rules, the module is classified by the rule as either fault-prone or not. Otherwise, the module is classified by the logistic model. We experimentally evaluated the prediction performance of the proposed method with different thresholds of each rule interestingness measure (support, confidence and lift) using a module set in the Eclipse project, and compared it with three well-known fault-proneness models (logistic regression model, linear discriminant model and classification tree). The result showed that the improvement of the Fl-value of the proposed method was 0.163 at maximum compared to conventional models.
UR - http://www.scopus.com/inward/record.url?scp=62949103673&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=62949103673&partnerID=8YFLogxK
U2 - 10.1145/1414004.1414051
DO - 10.1145/1414004.1414051
M3 - Conference contribution
AN - SCOPUS:62949103673
SN - 9781595939715
T3 - ESEM'08: Proceedings of the 2008 ACM-IEEE International Symposium on Empirical Software Engineering and Measurement
SP - 279
EP - 281
BT - ESEM'08
T2 - 2nd International Symposium on Empirical Software Engineering and Measurement, ESEM 2008
Y2 - 9 October 2008 through 10 October 2008
ER -