Database Paper Browser

Back to papers

Pollock: A Data Loading Benchmark

Summary: Introduces Pollock, a benchmark and formal pollution model to generate realistic non‑standard CSV dialects and structural corruptions based on a survey of real-world files. Uses this framework to evaluate robustness of 16 parsing, DB, spreadsheet and visualization systems. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
13043
Venue
VLDB
Year
2023
Pagerank
4.6457732e-05
Overall Rank
7,807 | 45.69%
DOI
10.14778/3594512.3594518

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 2 of 2 citing papers.

Rank Citing Paper Year Venue Pagerank
2,587 Table-GPT: Table Fine-tuned GPT for Diverse Table Tasks 2024 SIGMOD 8.4924618e-05
5,928 SchemaPile: A Large Collection of Relational Database Schemas 2024 SIGMOD 5.2685946e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 7 of 7 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers