SpareLLM: Automatically Selecting Task-Specific Minimum-Cost Large Language Models under Equivalence Constraint
Summary: SpareLLM selects task-specific, minimum-cost LLMs with an output-equivalence constraint. Profiling-first approach and heterogeneous model cascades yield Pareto-optimal cost-accuracy tradeoffs, up to 8.6x savings and 90% equivalence to GPT-4-Turbo. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Saehan Jo
- 2. Immanuel Trummer
Incoming Citations (Sorted by Pagerank)
Showing 3 of 3 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 10,143 | Beluga: A CXL-Based Memory Architecture for Scalable and Efficient LLM KVCache Management | 2026 | SIGMOD | 4.1945683e-05 |
| 10,194 | PRISM: Navigating Cost–Accuracy Trade-offs for NL2SQL | 2026 | SIGMOD | 4.1945683e-05 |
| 10,215 | Task Cascades for Efficient Unstructured Data Processing | 2026 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 7 of 7 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,152 | Blink and It's Done: Interactive Queries on Very Large Data | 2012 | VLDB | 0.00013645792 |
| 1,204 | VerdictDB: Universalizing Approximate Query Processing | 2018 | SIGMOD | 0.00013319541 |
| 1,323 | Quickr: Lazily Approximating Complex AdHoc Queries in BigData Clusters | 2016 | SIGMOD | 0.00012601997 |
| 1,574 | Approximate Query Processing: No Silver Bullet | 2017 | SIGMOD | 0.00011287495 |
| 2,011 | Rapid Sampling for Visualizations with Ordering Guarantees | 2015 | VLDB | 9.7964875e-05 |
| 2,995 | A Sampling Algebra for Aggregate Estimation | 2013 | VLDB | 7.7587199e-05 |
| 11,552 | BitGourmet: Deterministic Approximation via Optimized Bit Selection | 2020 | CIDR | 4.1945683e-05 |
Previous
Page 1 / 1
Next