TY - GEN
T1 - Discovering characteristic patterns from collections of classical Japanese poems
AU - Yamasaki, Mayumi
AU - Takeda, Masayuki
AU - Fukuda, Tomoko
AU - Nanri, Ichirō
N1 - Publisher Copyright:
© Springer-Verlag Berlin Heidelberg 1998.
Copyright:
Copyright 2020 Elsevier B.V., All rights reserved.
PY - 1998
Y1 - 1998
N2 - Waka is a form of traditional Japanese poetry with a 1300- year history. In this paper, we attempt to discover characteristics common to a collection of waka poems. As a formalism for characteristics, we use regular patterns where the constant parts are limited to sequences of auxiliary verbs and postpositional particles. We call such patterns fushi. The problem is to find automatically significant fushi patterns that characterize the poems. Solving this problem requires a reliable significance measure for the patterns. Brāzma et al. (1996) proposed such a measure according to the MDL principle. Using this method, we report successful results in finding patterns from five anthologies. Some of the results are quite stimulating, and we hope that they will lead to new discoveries. Based on our experience, we also propose a pattern-based text data mining system. Further research into waka poetry is now proceeding using this system.
AB - Waka is a form of traditional Japanese poetry with a 1300- year history. In this paper, we attempt to discover characteristics common to a collection of waka poems. As a formalism for characteristics, we use regular patterns where the constant parts are limited to sequences of auxiliary verbs and postpositional particles. We call such patterns fushi. The problem is to find automatically significant fushi patterns that characterize the poems. Solving this problem requires a reliable significance measure for the patterns. Brāzma et al. (1996) proposed such a measure according to the MDL principle. Using this method, we report successful results in finding patterns from five anthologies. Some of the results are quite stimulating, and we hope that they will lead to new discoveries. Based on our experience, we also propose a pattern-based text data mining system. Further research into waka poetry is now proceeding using this system.
UR - http://www.scopus.com/inward/record.url?scp=84949208217&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84949208217&partnerID=8YFLogxK
U2 - 10.1007/3-540-49292-5_12
DO - 10.1007/3-540-49292-5_12
M3 - Conference contribution
AN - SCOPUS:84949208217
SN - 3540653902
SN - 9783540653905
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 129
EP - 141
BT - Discovery Science - 1st International Conference, DS 1998, Proceedings
A2 - Arikawa, Setsuo
A2 - Motoda, Hiroshi
PB - Springer Verlag
T2 - 1st International Conference on Discovery Science, DS 1998
Y2 - 14 December 1998 through 16 December 1998
ER -