COMPARE: Accelerating Groupwise Comparison in Relational Databases for Data Analytics

Summary: Proposes Compare, a relational operator for concise enumeration and cross-subset comparison. Optimizations exploiting Compare semantics, implemented in SQL Server, yield speedups over standard plans, UDFs, and middleware for large-scale analytics. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID: 12419
Venue: VLDB
Year: 2021
Pagerank: 4.2680295e-05
Overall Rank: 9,850 | 31.55%
DOI: 10.14778/3476249.3476291

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 2 of 2 citing papers.

Rank	Citing Paper	Year	Venue	Pagerank
9,992	Leveraging Query Optimizers to Verify the Soundness of LLM-based Query Rewrites for Real-World Workloads, and More!	2026	CIDR	4.1905499e-05
10,879	SDEcho: Efficient Explanation of Aggregated Sequence Difference	2025	VLDB	4.1905499e-05

Outgoing Citations (Sorted by Pagerank)

Showing 24 of 24 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank	Cited Paper	Year	Venue	Pagerank
125	Approximate String Joins in a Database (Almost) for Free	2001	VLDB	0.00044946098
247	On the Computation of Multidimensional Aggregates	1996	VLDB	0.00030923842
248	Efficient set joins on similarity predicates	2004	SIGMOD	0.00030888982
264	Efficient Exact Set-Similarity Joins	2006	VLDB	0.00029950264
362	Fast Similarity Search in the Presence of Noise, Scaling, and Translation in Time-Series Databases	1995	VLDB	0.00025758421
461	SeeDB: Efficient Data-Driven Visualization Recommendations to Support Visual Analytics	2015	VLDB	0.00022615628
659	The Making of TPC-DS	2006	VLDB	0.00018514913
762	Explaining differences in multidimensional aggregates	1999	VLDB	0.00016984985
992	Effortless Data Exploration with zenvisage: An Expressive and Interactive Visual Analytics System	2017	VLDB	0.00014795333
999	Similarity-Based Queries for Time Series Data	1997	SIGMOD	0.00014726031
1,043	Set Containment Joins: The Good, The Bad and The Ugly	2000	VLDB	0.00014468616
1,059	Answering Complex SQL Queries Using Automatic Summary Tables	2000	SIGMOD	0.00014370009
1,205	PIVOT and UNPIVOT: Optimization and Execution Strategies in an RDBMS	2004	VLDB	0.00013309393
1,754	Querying Multiple Features of Groups in Relational Databases	1996	VLDB	0.00010663123
2,158	DIFF: A Relational Interface for Large-Scale Data Explanation	2019	VLDB	9.4117885e-05
2,614	i3: Intelligent, Interactive Investigation of OLAP data cubes	2000	SIGMOD	8.4538942e-05
2,738	The Case for Data Visualization Management Systems [Vision Paper]	2014	VLDB	8.2039241e-05
2,939	AIDA - Abstraction for Advanced In-Database Analytics	2018	VLDB	7.8514162e-05
3,678	Providing Better Support for a Class of Decision Support Queries	1996	SIGMOD	6.8482668e-05
4,679	Adaptive Sampling for Rapidly Matching Histograms	2018	VLDB	5.9978792e-05
5,193	QuickInsights: Quick and Automatic Discovery of Insights from Multi-Dimensional Data	2019	SIGMOD	5.6322426e-05
5,742	Efficient Computation of Multiple Group By Queries	2005	SIGMOD	5.3432178e-05
6,840	Towards Democratizing Relational Data Visualization	2019	SIGMOD	4.9058387e-05
9,338	ShapeSearch: Flexible Pattern-based Querying of Trend Line Visualizations	2018	VLDB	4.351469e-05

Semantically Similar Papers

Overall Rank	Paper	Year	Venue	Pagerank
1,581	Execution Strategies for SQL Subqueries	2007	SIGMOD	0.00011262795
5,742	Efficient Computation of Multiple Group By Queries	2005	SIGMOD	5.3432178e-05
5,728	Conjunctive Queries with Comparisons	2022	SIGMOD	5.350072e-05
1,475	Efficient Exploitation of Similar Subexpressions for Query Processing	2007	SIGMOD	0.00011765071
10,973	Relational Algorithms for Top-k Query Evaluation	2024	SIGMOD	4.1905499e-05
638	Towards a Unified Architecture for in-RDBMS Analytics	2012	SIGMOD	0.00018810785
1,952	Groupwise Processing of Relational Queries	1997	VLDB	9.9806285e-05
5,078	Combi-Operator – Database Support for Data Mining Applications	2003	VLDB	5.708568e-05
7,161	Computing the Difference of Conjunctive Queries Efficiently	2023	SIGMOD	4.8086254e-05
1,754	Querying Multiple Features of Groups in Relational Databases	1996	VLDB	0.00010663123