Database Paper Browser

Back to papers

ConnectorX: Accelerating Data Loading From Databases to Dataframes

Summary: ConnectorX speeds loading from DBMSs to dataframes by reducing client-side overhead—the dominant cost—rather than query execution or data transfer. Server-side result partitioning and a modular design enable easy extension to multiple databases and dataframes, yielding speedups over prior libraries. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
12782
Venue
VLDB
Year
2022
Pagerank
5.0216945e-05
Overall Rank
6,541 | 54.50%
DOI
10.14778/3551793.3551847

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 4 of 4 citing papers.

Rank Citing Paper Year Venue Pagerank
10,177 InferF: Declarative Factorization of AI/ML Inferences over Joins 2026 SIGMOD 4.1945683e-05
10,482 Fast and Scalable Data Transfer Across Data Systems 2025 SIGMOD 4.1945683e-05
10,591 Accio: Bolt-on Query Federation 2025 VLDB 4.1945683e-05
10,806 Enter the Warp: Fast and Adaptive Data Transfer with XDBC 2025 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 17 of 17 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
66 Spark SQL: Relational Data Processing in Spark 2015 SIGMOD 0.00061639801
167 The Snowflake Elastic Data Warehouse 2016 SIGMOD 0.00039180521
1,279 Towards Linear Algebra over Normalized Data 2017 VLDB 0.00012868394
1,377 Lakehouse: A New Generation of Open Platforms that Unify Data Warehousing and Advanced Analytics 2021 CIDR 0.00012296941
1,427 Towards Scalable Dataframe Systems 2020 VLDB 0.0001204248
1,495 Ricardo: Integrating R and Hadoop 2010 SIGMOD 0.00011691049
2,062 Dremel: A Decade of Interactive SQL Analysis at Web Scale 2020 VLDB 9.6481955e-05
2,443 Data Management for Data Science: Towards Embedded Analytics 2020 CIDR 8.8078476e-05
2,804 Extending Relational Query Processing with ML Inference 2020 CIDR 8.0935487e-05
2,934 AIDA - Abstraction for Advanced In-Database Analytics 2018 VLDB 7.8595778e-05
2,954 Magpie: Python at Speed and Scale using Cloud Backends 2021 CIDR 7.8262582e-05
3,099 DB4ML – An In-Memory Database Kernel with Machine Learning Support 2020 SIGMOD 7.5642871e-05
3,958 MLog: Towards Declarative In-Database Machine Learning 2017 VLDB 6.5897636e-05
4,419 Don't Hold My Data Hostage - A Case For Client Protocol Redesign 2017 VLDB 6.2022597e-05
4,813 Putting Pandas in a Box 2021 CIDR 5.9049746e-05
5,964 Bridging Two Worlds with RICE: Integrating R into the SAP In-Memory Computing Engine 2011 VLDB 5.2520617e-05
6,666 Mainlining Databases: Supporting Fast Transactional Workloads on Universal Columnar Data File Formats 2021 VLDB 4.9691571e-05
Previous Page 1 / 1 Next

Semantically Similar Papers