TY - GEN
T1 - On Embedding Backdoor in Malware Detectors Using Machine Learning
AU - Sasaki, Shoichiro
AU - Hidano, Seira
AU - Uchibayashi, Toshihiro
AU - Suganuma, Takuo
AU - Hiji, Masahiro
AU - Kiyomoto, Shinsaku
PY - 2019/8
Y1 - 2019/8
N2 - Researching for malware detection using machine learning is becoming active. However, conventional detection techniques do not consider the impact of attacks on machine learning, which has become complicated in recent years. In this research, we focus on data poisoning attack, which is one of the typical attacks on machine learning, and aim to clarify the influence of attacks on malware detection technology. Data poisoning attack is an attack method that intentionally manipulates the predicted result of a learned model by injecting poisoning data into training data, and by applying this, it is possible to embed a backdoor that induces mis-prediction of only specific input data. In this paper, we first propose an attack framework for backdoor embedding that prevents detection of only specific types of malware by data poisoning attack. Next, we will describe a method to generate poisoning data efficiently while avoiding attack detection by solving the optimization problem. Furthermore, we take malware detection technology using logistic regression and show the effectiveness of the our method through evaluation experiments using two datasets.
AB - Researching for malware detection using machine learning is becoming active. However, conventional detection techniques do not consider the impact of attacks on machine learning, which has become complicated in recent years. In this research, we focus on data poisoning attack, which is one of the typical attacks on machine learning, and aim to clarify the influence of attacks on malware detection technology. Data poisoning attack is an attack method that intentionally manipulates the predicted result of a learned model by injecting poisoning data into training data, and by applying this, it is possible to embed a backdoor that induces mis-prediction of only specific input data. In this paper, we first propose an attack framework for backdoor embedding that prevents detection of only specific types of malware by data poisoning attack. Next, we will describe a method to generate poisoning data efficiently while avoiding attack detection by solving the optimization problem. Furthermore, we take malware detection technology using logistic regression and show the effectiveness of the our method through evaluation experiments using two datasets.
UR - http://www.scopus.com/inward/record.url?scp=85078813840&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85078813840&partnerID=8YFLogxK
U2 - 10.1109/PST47121.2019.8949034
DO - 10.1109/PST47121.2019.8949034
M3 - Conference contribution
T3 - 2019 17th International Conference on Privacy, Security and Trust, PST 2019 - Proceedings
BT - 2019 17th International Conference on Privacy, Security and Trust, PST 2019 - Proceedings
A2 - Ghorbani, Ali
A2 - Ray, Indrakshi
A2 - Lashkari, Arash Habibi
A2 - Zhang, Jie
A2 - Lu, Rongxing
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 17th International Conference on Privacy, Security and Trust, PST 2019
Y2 - 26 August 2019 through 28 August 2019
ER -