Bit Catastrophes for the Burrows-Wheeler Transform

Sara Giuliani, Shunsuke Inenaga, Zsuzsanna Lipták, Giuseppe Romana, Marinella Sciortino, Cristian Urbina

研究成果: ジャーナルへの寄稿学術誌査読

抄録

A bit catastrophe, loosely defined, is when a change in just one character of a string causes a significant change in the size of the compressed string. We study this phenomenon for the Burrows-Wheeler Transform (BWT), a string transform at the heart of several of the most popular compressors and aligners today. The parameter determining the size of the compressed data is the number of equal-letter runs of the BWT, commonly denoted r. We exhibit infinite families of strings in which insertion, deletion, resp. substitution of one character increases r from constant to Θ(logn), where n is the length of the string. These strings can be interpreted both as examples for an increase by a multiplicative or an additive Θ(logn)-factor. As regards the multiplicative factor, they attain the upper bound given by Akagi, Funakoshi, and Inenaga [Inf & Comput. 2023] of O(lognlogr), since here r=O(1). We then give examples of strings in which insertion, deletion, resp. substitution of a character increases r by a Θ(n) additive factor. These strings significantly improve the best known lower bound for an additive factor of Ω(logn) [Giuliani et al., SOFSEM 2021].

本文言語英語
論文番号19
ジャーナルTheory of Computing Systems
69
2
DOI
出版ステータス出版済み - 6月 2025

!!!All Science Journal Classification (ASJC) codes

  • 理論的コンピュータサイエンス
  • 計算理論と計算数学

フィンガープリント

「Bit Catastrophes for the Burrows-Wheeler Transform」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル