This is a temporary home for the Stack dataset. This dataset was first introduced with Bao, a learned query optimizer. After completing a large potion of this project at MIT before COVID. A few weeks ago, I finally got access to my lab machines again, so please be patient as I upload the data and query workloads. :)
You can download the PostgreSQL archive from these link:
We are also making available a set of 5000+ queries that can be executed against the data. These queries are either taken from StackOverflow analytics dashboards (possibly recreated from them), or specifically generated to strain a query optimizer. These queries are made available under the terms of the MIT license.