Abstract
We consider compact representations of collections of similar strings that support random access queries. The collection of strings is given by a rooted tree where edges are labeled by an edit operation (inserting, deleting, or replacing a character) and a node represents the string obtained by applying the sequence of edit operations on the path from the root to the node. The goal is to compactly represent the entire collection while supporting fast random access to any part of a string in the collection. This problem captures natural scenarios such as representing the past history of an edited document or representing highly-repetitive collections. Given a tree with n nodes, we show how to represent the corresponding collection in O(n) space and optimal O(log n/log log n) query time. This improves the previous time-space trade-offs for the problem. To obtain our results, we introduce new techniques and ideas, including a reduction to a new geometric line segment selection together with an efficient solution.
Original language | English |
---|---|
Title of host publication | Proceedings of 31st International Symposium on Algorithms and Computation |
Editors | Yixin Cao, Siu-Wing Cheng, Minming Li |
Publisher | Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing |
Publication date | Dec 2020 |
Pages | 481-4816 |
Article number | 48 |
ISBN (Electronic) | 9783959771733 |
DOIs | |
Publication status | Published - Dec 2020 |
Event | 31st International Symposium on Algorithms and Computation - Virtual, Hong Kong, China Duration: 14 Dec 2020 → 18 Dec 2020 Conference number: 31 |
Conference
Conference | 31st International Symposium on Algorithms and Computation |
---|---|
Number | 31 |
Country/Territory | China |
City | Virtual, Hong Kong |
Period | 14/12/2020 → 18/12/2020 |
Series | Leibniz International Proceedings in Informatics, LIPIcs |
---|---|
Volume | 181 |
ISSN | 1868-8969 |
Bibliographical note
Funding Information:Supported by the Danish Research Council (DFF ? 4005-00267, DFF ? 1323-00178).
Publisher Copyright:
© Philip Bille and Inge Li Gørtz.
Keywords
- Data compression
- Data structures
- Persistent strings