E = MC^3: Managing Uncertain Enterprise Data in a Cluster-Computing Environment
Summary: Extends MCDB to a MapReduce cluster for scalable Monte Carlo analytics over uncertain enterprise data. Introduces distributed seed-generation algorithms, maps MCDB plans to MapReduce over nested data, and supports non-relational storage, with scalable performance. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Fei Xu
- 2. Kevin Beyer
- 3. Vuk Ercegovac
- 4. Peter J. Haas
- 5. Eugene J. Shekita
Incoming Citations (Sorted by Pagerank)
Showing 2 of 2 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,844 | Data is Dead… Without What-If Models | 2011 | VLDB | 5.8813803e-05 |
| 7,787 | Jigsaw: Efficient Optimization Over Uncertain Enterprise Data | 2011 | SIGMOD | 4.6512526e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 7 of 7 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 15 | Map-Reduce-Merge: Simplified Relational Data Processing on Large Clusters | 2007 | SIGMOD | 0.0010654262 |
| 53 | PNUTS: Yahoo!'s Hosted Data Serving Platform | 2008 | VLDB | 0.00066144767 |
| 299 | Trio: A System for Data, Uncertainty, and Lineage | 2006 | VLDB | 0.00028525071 |
| 321 | MCDB: A Monte Carlo Approach to Managing Uncertain Data | 2008 | SIGMOD | 0.00027527389 |
| 706 | MYSTIQ: A system for finding more answers by using probabilities | 2005 | SIGMOD | 0.00017845469 |
| 980 | BayesStore: Managing Large, Uncertain Data Repositories with Probabilistic Graphical Models | 2008 | VLDB | 0.00014879747 |
| 2,331 | Orion 2.0: Native Support for Uncertain Data | 2008 | SIGMOD | 9.018559e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 10,657 | Stochastic SketchRefine: Scaling In-Database Decision-Making under Uncertainty to Millions of Tuples | 2025 | VLDB | 4.1945683e-05 |
| 4,689 | Algorithmic Aspects of Parallel Query Processing | 2018 | SIGMOD | 5.9980099e-05 |
| 6,079 | Querying Uncertain Data with Aggregate Constraints | 2011 | SIGMOD | 5.2223439e-05 |
| 6,749 | Evaluation of Probabilistic Threshold Queries in MCDB | 2010 | SIGMOD | 4.9396725e-05 |
| 12,299 | Exceeding Expectations and Clustering Uncertain Data | 2009 | PODS | 4.1945683e-05 |
| 11,929 | Processing of Probabilistic Skyline Queries Using MapReduce | 2015 | VLDB | 4.1945683e-05 |
| 2,186 | Scalable Probabilistic Databases with Factor Graphs and MCMC | 2010 | VLDB | 9.3378109e-05 |
| 12,272 | Conditioning and Aggregating Uncertain Data Streams: Going Beyond Expectations | 2010 | VLDB | 4.1945683e-05 |
| 5,969 | MCDB-R: Risk Analysis in the Database | 2010 | VLDB | 5.2489117e-05 |
| 321 | MCDB: A Monte Carlo Approach to Managing Uncertain Data | 2008 | SIGMOD | 0.00027527389 |