MRCSI: Compressing and Searching String Collections with Multiple References
Summary: MRCSI: multi-reference compression for dissimilar strings with automatic ref. selection and matching under edit distance. NP-hard to optimize MRCSI; 3 heuristics yield higher compression, a practical trade-off between storage and search efficiency. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
- 1. Sebastian Wandelt
- 2. Ulf Leser
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 3 of 3 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,140 | EntityRank: Searching Entities Directly and Holistically | 2007 | VLDB | 0.00013720706 |
| 6,018 | Relative Lempel-Ziv Factorization for Efficient Storage and Retrieval of Web Collections | 2012 | VLDB | 5.2415551e-05 |
| 12,086 | RCSI: Scalable similarity search in thousand(s) of genomes | 2013 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,134 | Dictionary-based Order-preserving String Compression for Main Memory Column Stores | 2009 | SIGMOD | 0.00013761456 |
| 5,887 | Efficient Approximate Search on String Collections (Tutorial) | 2009 | VLDB | 5.2879769e-05 |
| 7,777 | Indexing Mixed Types for Approximate Retrieval | 2005 | VLDB | 4.653704e-05 |
| 1,184 | On Effective Multi-Dimensional Indexing for Strings | 2000 | SIGMOD | 0.00013455208 |
| 4,897 | The Wavelet Trie: Maintaining an Indexed Sequence of Strings in Compressed Space | 2012 | PODS | 5.8469152e-05 |
| 6,464 | Reference-Based Indexing of Sequence Databases | 2006 | VLDB | 5.0532607e-05 |
| 9,498 | Memory-Efficient Search Trees for Database Management Systems | 2021 | SIGMOD | 4.3341665e-05 |
| 4,333 | An Efficient Index Structure for String Databases | 2001 | VLDB | 6.2805237e-05 |
| 8,660 | On Searching Compressed String Collections Cache-Obliviously | 2008 | PODS | 4.4722862e-05 |
| 12,086 | RCSI: Scalable similarity search in thousand(s) of genomes | 2013 | VLDB | 4.1945683e-05 |