Database Paper Browser

Back to papers

Efficient Document Analytics on Compressed Data: Method, Challenges, Algorithms, Insights

Summary: Compression-based direct processing for document analytics on compressed data via Sequitur's hierarchical grammars. Guidelines and modules to enable practice; experiments show 90.8% storage savings, 77.5% memory savings, and 1.6x (sequential) to 2.2x (distributed) speedups. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
11639
Venue
VLDB
Year
2018
Pagerank
6.1073703e-05
Overall Rank
4,531 | 68.48%
DOI
10.14778/3236187.3236203

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 12 of 12 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 3 of 3 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
4 Pregel: A System for Large-Scale Graph Processing 2010 SIGMOD 0.0019005923
193 On Supporting Containment Queries in Relational Database Management Systems 2001 SIGMOD 0.00035610321
459 Processing Analytical Queries over Encrypted Data 2013 VLDB 0.00022627746
Previous Page 1 / 1 Next

Semantically Similar Papers