Abstract
We present an index for top-k most frequent document retrieval whose space is |CSA|+o(n)+DlognD+O(D) bits, and its query time is O(logklog2 +Ïμn) per reported document, where D is the number of documents, n is the sum of lengths of the documents, and |CSA| is the space of the compressed suffix array for the documents. This improves over previous results for this problem, whose space complexities are |CSA|+ω(n) or 2|CSA|+ω(1).
| Original language | American English |
|---|---|
| Pages (from-to) | 440-443 |
| Number of pages | 4 |
| Journal | Information Processing Letters |
| Volume | 113 |
| Issue number | 12 |
| DOIs | |
| State | Published - 17 Apr 2013 |
Keywords
- Data structures
- Document retrieval
- Text indexing
All Science Journal Classification (ASJC) codes
- Theoretical Computer Science
- Signal Processing
- Information Systems
- Computer Science Applications