Automatic Generation of Normalized Relational Schemas from Nested Key-Value Data
Summary: Automates denormalized, nested key-value data into relational schemas. Schema-gen discovers cross-attribute relations; a matching step merges overlapping entity sets, reducing redundancy and enabling efficient relational analytics on NoSQL data. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 9 of 9 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,819 | Mison: A Fast JSON Parser for Data Analytics | 2017 | VLDB | 8.0651326e-05 |
| 4,704 | JSON Tiles: Fast Analytics on Semi-Structured Data | 2021 | SIGMOD | 5.9853687e-05 |
| 6,231 | An LSM-based Tuple Compaction Framework for Apache AsterixDB | 2020 | VLDB | 5.1457863e-05 |
| 7,571 | Reducing Ambiguity in Json Schema Discovery | 2021 | SIGMOD | 4.7075853e-05 |
| 9,851 | Adaptive Schema Databases | 2017 | CIDR | 4.2721228e-05 |
| 10,508 | Synthesizing Third Normal Form Schemata that Minimize Integrity Maintenance and Update Overheads: Parameterizing 3NF by the Numbers of Minimal Keys and Functional Dependencies | 2025 | SIGMOD | 4.1945683e-05 |
| 11,150 | Zed: Leveraging Data Types to Process Eclectic Data | 2023 | CIDR | 4.1945683e-05 |
| 11,189 | dsJSON: A Distributed SQL JSON Processor | 2023 | SIGMOD | 4.1945683e-05 |
| 11,690 | Integration of Large-Scale Data Processing Systems and Traditional Parallel Database Technology | 2019 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 6 of 6 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 153 | Relational Databases for Querying XML Documents: Limitations and Opportunities | 1999 | VLDB | 0.00040784455 |
| 207 | Storing Semistructured Data with STORED | 1999 | SIGMOD | 0.00034611968 |
| 224 | CORDS: Automatic Discovery of Correlations and Soft Functional Dependencies | 2004 | SIGMOD | 0.00032746205 |
| 992 | XTRACT: A System for Extracting Document Type Descriptors from XML Documents | 2000 | SIGMOD | 0.00014799689 |
| 2,001 | Sinew: A SQL System for Multi-Structured Data | 2014 | SIGMOD | 9.8186417e-05 |
| 5,108 | ShreX: Managing XML Documents in Relational Databases | 2004 | VLDB | 5.6919861e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 5,910 | Normalizing Property Graphs | 2023 | VLDB | 5.2768691e-05 |
| 11,575 | JSON Schema Matching: Empirical Observations | 2020 | SIGMOD | 4.1945683e-05 |
| 2,110 | A Recursive Algebra and Query Optimization for Nested Relations | 1989 | SIGMOD | 9.5315487e-05 |
| 5,303 | The Partial Normalized Storage Model of Nested Relations | 1988 | VLDB | 5.5779667e-05 |
| 4,704 | JSON Tiles: Fast Analytics on Semi-Structured Data | 2021 | SIGMOD | 5.9853687e-05 |
| 3,823 | Automatic Discovery of Attributes in Relational Databases | 2011 | SIGMOD | 6.7261168e-05 |
| 1,052 | A Normal Form for Nested Relations | 1985 | PODS | 0.00014436103 |
| 10,765 | Towards Principled, Practical Document Database Design | 2025 | VLDB | 4.1945683e-05 |
| 2,904 | Nested Mappings: Schema Mapping Reloaded | 2006 | VLDB | 7.9355829e-05 |
| 6,180 | The Design of non-1NF Relational Databases into Nested Normal Form | 1987 | SIGMOD | 5.1686632e-05 |