Database Paper Browser

Back to papers

Online Maintenance of Very Large Random Samples on Flash Storage

Summary: Introduces B-FILE, a flash-centric abstraction for self-expiring items to maintain large samples from streams. Shows reservoir/Geometric File ill-suited for flash; enables biased sampling, fast subsamples, and semi-random writes that beat random writes. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
9729
Venue
VLDB
Year
2008
Pagerank
0.00011806921
Overall Rank
1,475 | 89.75%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 14 of 14 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 6 of 6 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
345 Design of Flash-Based DBMS: An In-Page Logging Approach 2007 SIGMOD 0.00026677681
783 Random Sampling from Hash Files 1990 SIGMOD 0.00016704834
1,260 Dynamic Sample Selection for Approximate Query Processing 2003 SIGMOD 0.00012993347
2,368 Online Maintenance of Very Large Random Samples 2004 SIGMOD 8.9501526e-05
2,396 A Novel Index Supporting High Volume Data Warehouse Insertions 1999 VLDB 8.8997169e-05
4,500 Rethinking Data Management for Storage-centric Sensor Networks 2007 CIDR 6.1381791e-05
Previous Page 1 / 1 Next

Semantically Similar Papers