GeoDeepDive: Statistical Inference using Familiar Data-Processing Languages
Summary: GeoDeepDive shows end-to-end statistical inference over geology articles (text, tables, figures), addressing acquisition, extraction, and integration in a DB-like workflow. It uniquely supports feature engineering in SQL or Python and analyzes feedback from expert labeling, distant supervision, rules, and crowdsourcing within a traditional database-ML setting. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Ce Zhang
- 2. Vidhya Govindaraju
- 3. Christopher RĂ©
- 4. Jackson Borchardt
- 5. Shanan Peters
- 6. Tim Foltz
Incoming Citations (Sorted by Pagerank)
Showing 2 of 2 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,164 | SlimShot: In-Database Probabilistic Inference for Knowledge Bases | 2016 | VLDB | 6.3923099e-05 |
| 11,718 | A Demonstration of Sya: A Spatial Probabilistic Knowledge Base Construction System | 2018 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 3 of 3 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 140 | The MADlib Analytics Library or MAD Skills, the SQL | 2012 | VLDB | 0.00042270404 |
| 1,014 | Tuffy: Scaling up Statistical Inference in Markov Logic Networks using an RDBMS | 2011 | VLDB | 0.00014640258 |
| 4,077 | Towards High-Throughput Gibbs Sampling at Scale: A Study across Storage Managers | 2013 | SIGMOD | 6.4678697e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 6,811 | In-database Distributed Machine Learning: Demonstration using Teradata SQL Engine | 2019 | VLDB | 4.9200998e-05 |
| 3,335 | DeepJoin: Joinable Table Discovery with Pre-trained Language Models | 2023 | VLDB | 7.2065006e-05 |
| 9,436 | Transforming ML Predictive Pipelines into SQL with MASQ | 2021 | SIGMOD | 4.3430376e-05 |
| 4,906 | Machine Learning for Big Data | 2013 | SIGMOD | 5.8389053e-05 |
| 13,294 | Demonstration of ModelarDB: Model-Based Management of Dimensional Time Series | 2019 | SIGMOD | - |
| 8,633 | Demonstration: MacroBase, A Fast Data Analysis Engine | 2017 | SIGMOD | 4.4802036e-05 |
| 8,748 | Databases as Graphs: Predictive Queries for Declarative Machine Learning | 2023 | PODS | 4.456315e-05 |
| 3,635 | A Deep Dive into Deep Learning Approaches for Text-to-SQL Systems | 2021 | SIGMOD | 6.8981006e-05 |
| 13,244 | Deep Data Integration | 2021 | SIGMOD | - |
| 4,106 | Extracting Databases from Dark Data with DeepDive | 2016 | SIGMOD | 6.4456184e-05 |