Abstract
Given a string S of length N on a fixed alphabet of σ symbols, a grammar compressor produces a context-free grammar G of size n that generates S and only S. In this paper we describe data structures to support the following operations on a grammar-compressed string: access(S,i,j) (return substring S[i,j]), rank c (S,i) (return the number of occurrences of symbol c before position i in S), and select c (S,i) (return the position of the ith occurrence of c in S). Our main result for access is a method that requires \O(nlogN) bits of space and \O(logN+m/logσN) time to extract m = j − i + 1 consecutive symbols from S. Alternatively, we can achieve \O(logτN+m/logσN) query time using \O(nτlogτ(N/n)logN) bits of space, matching a lower bound stated by Verbin and Yu for strings where N is polynomially related to n when τ = log ε N. For rank and select we describe data structures of size \O(nσlogN) bits that support the two operations in \O(logN) time. We also extend our other structure to support both operations in \O(logτN) time using \O(nτσlogτ(N/n)logN) bits of space. When τ = log ε N the query time is O(logN/loglogN) and we provide a hardness result showing that significantly improving this would imply a major breakthrough on a hard graph-theoretical problem.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 23rd Annual European Symposium on Algorithms – ESA 2015 |
| Editors | Nikhil Bansal, Irene Finocchi |
| Publisher | Springer |
| Publication date | 2015 |
| Pages | 142-154 |
| ISBN (Print) | 978-3-662-48349-7 |
| ISBN (Electronic) | 978-3-662-48350-3 |
| DOIs | |
| Publication status | Published - 2015 |
| Event | 23rd European Symposium on Algorithms (ESA 2015) - Patras, Greece Duration: 14 Sept 2015 → 16 Sept 2015 Conference number: 23 http://algo2015.upatras.gr/esa/ |
Conference
| Conference | 23rd European Symposium on Algorithms (ESA 2015) |
|---|---|
| Number | 23 |
| Country/Territory | Greece |
| City | Patras |
| Period | 14/09/2015 → 16/09/2015 |
| Internet address |
| Series | Lecture Notes in Computer Science |
|---|---|
| Volume | 9294 |
| ISSN | 0302-9743 |
Fingerprint
Dive into the research topics of 'Access, Rank, and Select in Grammar-compressed Strings'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver