A Data Transformation System for Biological Data Sources
Summary: Transforms heterogeneous biological data (ASN.1, ACE, BLAST, FASTA) into queryable forms; handles nested, non-relational types (lists, variants) unseen in traditional DBs. Prototype with the Human Genome Center Chromosome 22; highlights querying/transforming bulk biological data and related optimizations. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Peter Buneman
- 2. Susan B. Davidson
- 3. K. Hart
- 4. C. Overton
- 5. Limsoon Wong
Incoming Citations (Sorted by Pagerank)
Showing 9 of 9 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 114 | A Query Language and Optimization Techniques for Unstructured Data | 1996 | SIGMOD | 0.00046339735 |
| 611 | Lineage Tracing for General Data Warehouse Transformations | 2001 | VLDB | 0.00019231115 |
| 1,314 | Semistructured Data | 1997 | PODS | 0.0001263326 |
| 1,384 | Managing Semantic Heterogeneity in Databases : A Theoretical Perspective | 1997 | PODS | 0.00012262892 |
| 1,412 | A Query Language for Multidimensional Arrays: Design, Implementation, and Optimization Techniques | 1996 | SIGMOD | 0.00012122159 |
| 2,730 | Open Data Integration | 2018 | VLDB | 8.2126735e-05 |
| 2,952 | On Wrapping Query Languages and Efficient XML Integration | 2000 | SIGMOD | 7.8300484e-05 |
| 6,407 | Just-In-Time Data Virtualization: Lightweight Data Management with ViDa | 2015 | CIDR | 5.076547e-05 |
| 12,695 | Approximate Query Translation Across Heterogeneous Information Sources | 2000 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 6 of 6 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 165 | Federated Database Systems for Managing Distributed, Heterogeneous, and Autonomous Databases | 1991 | VLDB | 0.00039502525 |
| 550 | Hash-Partitioned Join Method Using Dynamic Destaging Strategy | 1988 | VLDB | 0.00020359891 |
| 1,879 | A Call to Order | 1993 | PODS | 0.00010232242 |
| 2,071 | A New Way to Compute the Product and Join of Relations | 1980 | SIGMOD | 9.6196263e-05 |
| 2,317 | Database Programming in Machiavelli - a Polymorphic Language with Static Type Inference | 1989 | SIGMOD | 9.0409573e-05 |
| 2,426 | Towards an Effective Calculus for Object Query Languages | 1995 | SIGMOD | 8.8375297e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 11,894 | Building Highly-Optimized, Low-Latency Pipelines for Genomic Data Analysis | 2015 | CIDR | 4.1945683e-05 |
| 5,726 | Biological Data Management: Research, Practice and Opportunities | 2004 | VLDB | 5.3513474e-05 |
| 7,902 | Building Highly-Optimized, Low-Latency Pipelines for Genomic Data Analysis | 2015 | CIDR | 4.6215911e-05 |
| 13,867 | Information Management for Genome Level Bioinformatics | 2001 | VLDB | - |
| 14,202 | Data and Knowledge Bases for Genome Mapping: What Lies Ahead? | 1991 | VLDB | - |
| 12,139 | Massive Genomic Data Processing and Deep Analysis | 2012 | VLDB | 4.1945683e-05 |
| 6,413 | Managing Data from High-Throughput Genomic Processing: A Case Study | 2004 | VLDB | 5.0735389e-05 |
| 12,608 | Genomics Algebra: A New, Integrating Data Model, Language, and Tool for Processing and Querying Genomic Information | 2003 | CIDR | 4.1945683e-05 |
| 13,915 | A Database Platform for Bioinformatics | 2000 | VLDB | - |
| 12,289 | Data Management for High-Throughput Genomics | 2009 | CIDR | 4.1945683e-05 |