TY - JOUR
T1 - A SEGMENTATION METHOD FOR KUZUSHIJI BASED ON K-MEANS CLUSTERING
AU - Cui, Wenyi
AU - Inoue, Kohei
N1 - Publisher Copyright:
ICIC International © 2024.
PY - 2024/2
Y1 - 2024/2
N2 - Ancient Japanese books record a great amount of information, which are valuable research materials in study of history and culture. Over the past few years, there was a large-scale research on digitization of ancient Japanese books and we are convenient to use them due to open access on the Internet. However, it is a challenging problem to recognize those ancient Japanese books due to the complex background and unsteady shape of Japanese characters, which is called Kuzushiji. In this paper, we proposed a method to segment Japanese characters by using image processing and clustering. Our method is based on the analysis of character characteristics, which could identify the segmentation points more accurately. The validity of the proposed method was confirmed by the evaluation experiment.
AB - Ancient Japanese books record a great amount of information, which are valuable research materials in study of history and culture. Over the past few years, there was a large-scale research on digitization of ancient Japanese books and we are convenient to use them due to open access on the Internet. However, it is a challenging problem to recognize those ancient Japanese books due to the complex background and unsteady shape of Japanese characters, which is called Kuzushiji. In this paper, we proposed a method to segment Japanese characters by using image processing and clustering. Our method is based on the analysis of character characteristics, which could identify the segmentation points more accurately. The validity of the proposed method was confirmed by the evaluation experiment.
KW - Handwritten character segmentation
KW - Japanese historical book
KW - K-means clustering
KW - Kuzushiji
KW - Machine learning
UR - http://www.scopus.com/inward/record.url?scp=85184019889&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85184019889&partnerID=8YFLogxK
U2 - 10.24507/icicel.18.02.135
DO - 10.24507/icicel.18.02.135
M3 - Article
AN - SCOPUS:85184019889
SN - 1881-803X
VL - 18
SP - 135
EP - 141
JO - ICIC Express Letters
JF - ICIC Express Letters
IS - 2
ER -