Explanation of machine learning models using shapley additive explanation and application for real data in hospital

Yasunobu Nohara, Koutarou Matsumoto, Hidehisa Soejima, Naoki Nakashima

Research output: Contribution to journalArticlepeer-review

101 Citations (Scopus)

Abstract

Background and Objective: When using machine learning techniques in decision-making processes, the interpretability of the models is important. In the present paper, we adopted the Shapley additive explanation (SHAP), which is based on fair profit allocation among many stakeholders depending on their contribution, for interpreting a gradient-boosting decision tree model using hospital data. Methods: For better interpretability, we propose two novel techniques as follows: (1) a new metric of feature importance using SHAP and (2) a technique termed feature packing, which packs multiple similar features into one grouped feature to allow an easier understanding of the model without reconstruction of the model. We then compared the explanation results between the SHAP framework and existing methods using cerebral infarction data from our hospital. Results: The interpretation by SHAP was mostly consistent with that by the existing methods. We showed how the A/G ratio works as an important prognostic factor for cerebral infarction using proposed techniques. Conclusion: Our techniques are useful for interpreting machine learning models and can uncover the underlying relationships between features and outcome.

Original languageEnglish
Article number106584
JournalComputer Methods and Programs in Biomedicine
Volume214
DOIs
Publication statusPublished - Feb 2022

All Science Journal Classification (ASJC) codes

  • Software
  • Computer Science Applications
  • Health Informatics

Fingerprint

Dive into the research topics of 'Explanation of machine learning models using shapley additive explanation and application for real data in hospital'. Together they form a unique fingerprint.

Cite this