Goods: Organizing Google's Datasets
Summary: Goods crawls diverse enterprise datasets to build a scalable metadata catalog and infer relationships (similarity, provenance) across billions in a distributed landscape. Provides discovery, monitoring, annotation, and relationship-analysis services for enterprise data at scale. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Alon Halevy
- 2. Flip Korn
- 3. Natalya F. Noy
- 4. Christopher Olston
- 5. Neoklis Polyzotis
- 6. Sudip Roy
- 7. Steven Euijong Whang
Incoming Citations (Sorted by Pagerank)
Showing 44 of 44 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 9 of 9 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 107 | WebTables: Exploring the Power of Tables on the Web | 2008 | VLDB | 0.00048377684 |
| 109 | Dremel: Interactive Analysis of Web-Scale Datasets | 2010 | VLDB | 0.00048186983 |
| 149 | Trio: A System for Integrated Management of Data, Accuracy, and Lineage | 2005 | CIDR | 0.00041101118 |
| 420 | InfoGather: Entity Augmentation and Attribute Discovery By Holistic Matching with Web Tables | 2012 | SIGMOD | 0.00023719065 |
| 1,281 | DataHub: Collaborative Data Science & Dataset Version Management at Scale | 2015 | CIDR | 0.00012854744 |
| 1,565 | Principles of Dataset Versioning: Exploring the Recreation/Storage Tradeoff | 2015 | VLDB | 0.00011345567 |
| 1,833 | Data Wrangling: The Challenging Journey from the Wild to the Lake | 2015 | CIDR | 0.00010378976 |
| 3,347 | Collaborative Data Analytics with DataHub | 2015 | VLDB | 7.1921364e-05 |
| 8,135 | Applying WebTables in Practice | 2015 | CIDR | 4.5777549e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,358 | Organizing Data Lakes for Navigation | 2020 | SIGMOD | 7.1784949e-05 |
| 10,419 | Unified Lineage System: Tracking Data Provenance at Scale | 2025 | SIGMOD | 4.1945683e-05 |
| 10,439 | Finding What You’re Looking For: A Distribution-Aware Dataset Search Engine in Action | 2025 | SIGMOD | 4.1945683e-05 |
| 11,319 | Building a Shared Conceptual Model of Complex, Heterogeneous Data Systems: A Demonstration | 2022 | CIDR | 4.1945683e-05 |
| 11,629 | Leveraging Organizational Resources to Adapt Models to New Data Modalities | 2020 | VLDB | 4.1945683e-05 |
| 6,279 | Self-Organizing Data Containers | 2022 | CIDR | 5.1295282e-05 |
| 3,551 | Data Management Projects at Google | 2006 | SIGMOD | 6.9812665e-05 |
| 12,286 | The Case for a Structured Approach to Managing Unstructured Data | 2009 | CIDR | 4.1945683e-05 |
| 10,895 | Towards an Objective Metric for Data Value Through Relevance | 2024 | CIDR | 4.1945683e-05 |
| 4,530 | Big Metadata: When Metadata is Big Data | 2021 | VLDB | 6.1075429e-05 |