Abstract
We study a new variant of the pattern matching problem called cross-document pattern matching, which is the problem of indexing a collection of documents to support an efficient search for a pattern in a selected document, where the pattern itself is a substring of another document. Several variants of this problem are considered, and efficient linear space solutions are proposed with query time bounds that either do not depend at all on the pattern size or depend on it in a very limited way (doubly logarithmic). As a side result, we propose an improved solution to the weighted ancestor problem.
Original language | English |
---|---|
Pages (from-to) | 40-47 |
Number of pages | 8 |
Journal | Journal of Discrete Algorithms |
Volume | 24 |
DOIs | |
State | Published - Jan 2014 |
Externally published | Yes |
Keywords
- Algorithms
- Document reporting
- Pattern matching
- Weighted ancestor problem
All Science Journal Classification (ASJC) codes
- Theoretical Computer Science
- Discrete Mathematics and Combinatorics
- Computational Theory and Mathematics