TY - GEN
T1 - On Sensitivity of Compact Directed Acyclic Word Graphs
AU - Fujimaru, Hiroto
AU - Nakashima, Yuto
AU - Inenaga, Shunsuke
N1 - Publisher Copyright:
© 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.
PY - 2023
Y1 - 2023
N2 - Compact directed acyclic word graphs (CDAWGs) [Blumer et al. 1987] are a fundamental data structure on strings with applications in text pattern searching, data compression, and pattern discovery. Intuitively, the CDAWG of a string T is obtained by merging isomorphic subtrees of the suffix tree [Weiner 1973] of the same string T, thus CDAWGs are a compact indexing structure. In this paper, we investigate the sensitivity of CDAWGs when a single character edit operation (insertion, deletion, or substitution) is performed at the left-end of the input string T, namely, we are interested in the worst-case increase in the size of the CDAWG after a left-end edit operation. We prove that if e is the number of edges of the CDAWG for string T, then the number of new edges added to the CDAWG after a left-end edit operation on T is less than e. Further, we present almost matching lower bounds on the sensitivity of CDAWGs for all cases of insertion, deletion, and substitution.
AB - Compact directed acyclic word graphs (CDAWGs) [Blumer et al. 1987] are a fundamental data structure on strings with applications in text pattern searching, data compression, and pattern discovery. Intuitively, the CDAWG of a string T is obtained by merging isomorphic subtrees of the suffix tree [Weiner 1973] of the same string T, thus CDAWGs are a compact indexing structure. In this paper, we investigate the sensitivity of CDAWGs when a single character edit operation (insertion, deletion, or substitution) is performed at the left-end of the input string T, namely, we are interested in the worst-case increase in the size of the CDAWG after a left-end edit operation. We prove that if e is the number of edges of the CDAWG for string T, then the number of new edges added to the CDAWG after a left-end edit operation on T is less than e. Further, we present almost matching lower bounds on the sensitivity of CDAWGs for all cases of insertion, deletion, and substitution.
UR - http://www.scopus.com/inward/record.url?scp=85173572775&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85173572775&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-33180-0_13
DO - 10.1007/978-3-031-33180-0_13
M3 - Conference contribution
AN - SCOPUS:85173572775
SN - 9783031331794
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 168
EP - 180
BT - Combinatorics on Words - 14th International Conference, WORDS 2023, Proceedings
A2 - Frid, Anna
A2 - Mercaş, Robert
PB - Springer Science and Business Media Deutschland GmbH
T2 - 14th International Conference on Combinatorics on Words, WORDS 2023
Y2 - 12 June 2023 through 16 June 2023
ER -