Scolopax: Exploratory Analysis of Scientific Data
Summary: Scolopax is a hypothesis-search engine for exploratory scientific data analysis, with an intuitive UI and scalable parallel data management. It ranks vast hypothesis spaces using model training, summary generation, and novel parallel joins, demonstrated on 3.3M bird sightings across ~2500 attributes. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
- 1. Alper Okcan
- 2. Mirek Riedewald
- 3. Biswanath Panda
- 4. Daniel Fink
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 1 of 1 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,074 | Processing Theta-Joins using MapReduce* | 2011 | SIGMOD | 0.00014260096 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 13,143 | Bridging Disciplines in Data Management Research to Solve Complex Data Problems | 2025 | VLDB | - |
| 4,426 | Data Debugging and Exploration with Vizier | 2019 | SIGMOD | 6.1969994e-05 |
| 14,179 | Scientific Data Management: Real-World Issues and Requirements | 1992 | SIGMOD | - |
| 3,347 | Collaborative Data Analytics with DataHub | 2015 | VLDB | 7.1921364e-05 |
| 9,333 | ShapeSearch: Flexible Pattern-based Querying of Trend Line Visualizations | 2018 | VLDB | 4.3556432e-05 |
| 3,878 | Data Canopy: Accelerating Exploratory Statistical Analysis | 2017 | SIGMOD | 6.6731435e-05 |
| 10,813 | A Demonstration of Polaris: An Interactive and Scalable Data Infrastructure for Polar Science | 2025 | VLDB | 4.1945683e-05 |
| 8,344 | Exploring the Data Wilderness through Examples | 2019 | SIGMOD | 4.5428111e-05 |
| 11,463 | PyExplore: Query Recommendations for Data Exploration without Query Logs | 2021 | SIGMOD | 4.1945683e-05 |
| 1,909 | SciBORQ: Scientific data management with Bounds On Runtime and Quality | 2011 | CIDR | 0.00010121304 |