Optimally Computing Compressed Indexing Arrays Based on the Compact Directed Acyclic Word Graph

Hiroki Arimura, Shunsuke Inenaga, Yasuaki Kobayashi, Yuto Nakashima, Mizuki Sue

研究成果: 書籍/レポート タイプへの寄稿会議への寄与

2 被引用数 (Scopus)

抄録

In this paper, we present the first study of the computational complexity of converting an automata-based text index structure, called the Compact Directed Acyclic Word Graph (CDAWG), of size e for a text T of length n into other text indexing structures for the same text, suitable for highly repetitive texts: the run-length BWT of size r, the irreducible PLCP array of size r, and the quasi-irreducible LPF array of size e, as well as the lex-parse of size O(r) and the LZ77-parse of size z, where $$r, z \leqslant e$$. As main results, we showed that the above structures can be optimally computed from either the CDAWG for T stored in read-only memory or its self-index version of size e without a text in O(e) worst-case time and words of working space. To obtain the above results, we devised techniques for enumerating a particular subset of suffixes in the lexicographic and text orders using the forward and backward search on the CDAWG by extending the result by Belazzougui et al. in 2015.

本文言語英語
ホスト出版物のタイトルString Processing and Information Retrieval - 30th International Symposium, SPIRE 2023, Proceedings
編集者Franco Maria Nardini, Nadia Pisanti, Rossano Venturini
出版社Springer Science and Business Media Deutschland GmbH
ページ28-34
ページ数7
ISBN(印刷版)9783031439797
DOI
出版ステータス出版済み - 2023
イベント30th International Symposium on String Processing and Information Retrieval, SPIRE 2023 - Pisa, イタリア
継続期間: 9月 26 20239月 28 2023

出版物シリーズ

名前Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
14240 LNCS
ISSN(印刷版)0302-9743
ISSN(電子版)1611-3349

会議

会議30th International Symposium on String Processing and Information Retrieval, SPIRE 2023
国/地域イタリア
CityPisa
Period9/26/239/28/23

!!!All Science Journal Classification (ASJC) codes

  • 理論的コンピュータサイエンス
  • コンピュータサイエンス一般

フィンガープリント

「Optimally Computing Compressed Indexing Arrays Based on the Compact Directed Acyclic Word Graph」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル