TY - GEN
T1 - An Empirical Study of Source Code Detection Using Image Classification
AU - Hong, Juntong
AU - Mizuno, Osamu
AU - Kondo, Masanari
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/12
Y1 - 2019/12
N2 - The detection of programming language for a source code file has achieved high accuracy using the machine learning techniques. On the other hand, for a piece of software (called snippet), the detection of programming language is required to append tags automatically in a question and answer site such as Stack Overflow. However, the detection of programming language for a snippet is still a challenge since snippets is not a complete source code. Usually, experienced developers can detect the language of such snippet at a glance. It is considered that such a task that a human being easily solves can be solved by the image classification method using deep learning technique. Therefore, we propose a programming language detection method using a deep learning based image classification method. By using the data from actual Q&A site, we evaluate our proposed model. The results of experiment demonstrate that we can successfully detect the correct programming language for snippets with over 90% accuracy.
AB - The detection of programming language for a source code file has achieved high accuracy using the machine learning techniques. On the other hand, for a piece of software (called snippet), the detection of programming language is required to append tags automatically in a question and answer site such as Stack Overflow. However, the detection of programming language for a snippet is still a challenge since snippets is not a complete source code. Usually, experienced developers can detect the language of such snippet at a glance. It is considered that such a task that a human being easily solves can be solved by the image classification method using deep learning technique. Therefore, we propose a programming language detection method using a deep learning based image classification method. By using the data from actual Q&A site, we evaluate our proposed model. The results of experiment demonstrate that we can successfully detect the correct programming language for snippets with over 90% accuracy.
UR - http://www.scopus.com/inward/record.url?scp=85078156012&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85078156012&partnerID=8YFLogxK
U2 - 10.1109/IWESEP49350.2019.00009
DO - 10.1109/IWESEP49350.2019.00009
M3 - Conference contribution
AN - SCOPUS:85078156012
T3 - Proceedings - 2019 10th International Workshop on Empirical Software Engineering in Practice, IWESEP 2019
SP - 1
EP - 6
BT - Proceedings - 2019 10th International Workshop on Empirical Software Engineering in Practice, IWESEP 2019
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 10th International Workshop on Empirical Software Engineering in Practice, IWESEP 2019
Y2 - 13 December 2019 through 14 December 2019
ER -