Back to papers
LLMLog: Advanced Log Template Generation via LLM-driven Multi-Round Annotation
Summary: LLMLog: multi-round LLM-driven annotation for log template generation, using edit-distance similarity and informativeness sampling (representativeness + LLM confidence) to pick k logs. Adaptive in-context selection ensures keyword coverage per unlabeled log, improving template accuracy and outperforming prior methods across 16 datasets.
(summarized by gpt-5-mini on Feb 09 2026)
- Paper ID
- 13948
- Venue
- VLDB
- Year
- 2025
- Pagerank
- 4.1945683e-05
- Overall Rank
- 10,658 | 25.86%
- DOI
-
10.14778/3746405.3746433
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
Outgoing Citations (Sorted by Pagerank)
Showing 30 of 30 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 369 |
Text-to-SQL Empowered by Large Language Models: A Benchmark Evaluation |
2024 |
VLDB |
0.0002547515 |
| 690 |
An Analytical Study of Large SPARQL Query Logs |
2018 |
VLDB |
0.00018099792 |
| 1,373 |
Aether: A Scalable Approach to Logging |
2010 |
VLDB |
0.00012309902 |
| 2,517 |
Annotating Columns with Pre-trained Language Models |
2022 |
SIGMOD |
8.6092139e-05 |
| 3,472 |
LLM-R2: A Large Language Model Enhanced Rule-based Rewrite System for Boosting Query Efficiency |
2025 |
VLDB |
7.0639229e-05 |
| 3,508 |
spade: Synthesizing Data Quality Assertions for Large Language Model Pipelines |
2024 |
VLDB |
7.0271496e-05 |
| 3,592 |
Query Fresh: Log Shipping on Steroids |
2018 |
VLDB |
6.9406675e-05 |
| 3,995 |
How Large Language Models Will Disrupt Data Management |
2023 |
VLDB |
6.5513237e-05 |
| 4,204 |
Maestro: Automatic Generation of Comprehensive Benchmarks for Question Answering Over Knowledge Graphs |
2023 |
SIGMOD |
6.3607478e-05 |
| 4,328 |
How to Build Templates for RDF Question/Answering —An Uncertain Graph Similarity Join Approach |
2015 |
SIGMOD |
6.2866586e-05 |
| 5,059 |
High-Performance Row Pattern Recognition Using Joins |
2023 |
VLDB |
5.7277656e-05 |
| 5,099 |
ArcheType: A Novel Framework for Open-Source Column Type Annotation using Large Language Models |
2024 |
VLDB |
5.6997784e-05 |
| 5,517 |
Representing Paths in Graph Database Pattern Matching |
2023 |
VLDB |
5.4626107e-05 |
| 5,706 |
TencentCLS: The Cloud Log Service with High Query Performances |
2022 |
VLDB |
5.3611566e-05 |
| 6,151 |
An Efficient Transfer Learning Based Configuration Adviser for Database Tuning |
2024 |
VLDB |
5.183652e-05 |
| 6,389 |
Chat2Data: An Interactive Data Analysis System with RAG, Vector Databases and LLMs |
2024 |
VLDB |
5.0844009e-05 |
| 7,020 |
LLM for Data Management |
2024 |
VLDB |
4.8595728e-05 |
| 7,317 |
Accurate and Fast Approximate Graph Pattern Mining at Scale |
2025 |
VLDB |
4.7639399e-05 |
| 7,430 |
Adaptive Log Compression for Massive Log Data |
2013 |
SIGMOD |
4.7317713e-05 |
| 7,838 |
Auto-Validate: Unsupervised Data Validation Using Data-Domain Patterns Inferred from Data Lakes |
2021 |
SIGMOD |
4.6377995e-05 |
| 8,175 |
Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models |
2025 |
VLDB |
4.5676289e-05 |
| 8,182 |
SHiFT: An Efficient, Flexible Search Engine for Transfer Learning |
2023 |
VLDB |
4.5659133e-05 |
| 8,490 |
A Framework for Privacy Preserving Localized Graph Pattern Query Processing |
2023 |
SIGMOD |
4.499438e-05 |
| 8,579 |
RECA: Related Tables Enhanced Column Semantic Type Annotation Framework |
2023 |
VLDB |
4.4922446e-05 |
| 9,871 |
From Logs to Causal Inference: Diagnosing Large Systems |
2025 |
VLDB |
4.2667743e-05 |
| 9,872 |
Substructure-aware Log Anomaly Detection |
2025 |
VLDB |
4.2667743e-05 |
| 9,873 |
CORAL: Collaborative Automatic Labeling System based on Large Language Models |
2024 |
VLDB |
4.2667743e-05 |
| 9,874 |
HSAP: A Human-in-the-loop Social Media-based Situation Awareness Platform |
2024 |
VLDB |
4.2667743e-05 |
| 9,875 |
A Universal Question-Answering Platform for Knowledge Graphs |
2023 |
SIGMOD |
4.2667743e-05 |
| 9,876 |
Near-Duplicate Sequence Search at Scale for Large Language Model Memorization Evaluation |
2023 |
SIGMOD |
4.2667743e-05 |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 9,872 |
Substructure-aware Log Anomaly Detection |
2025 |
VLDB |
4.2667743e-05 |
| 1,116 |
Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes |
2024 |
VLDB |
0.00013890154 |
| 10,452 |
ScaleLLM: A Technique for Scalable LLM-augmented Data Systems |
2025 |
SIGMOD |
4.1945683e-05 |
| 10,316 |
LLM-AutoDP: Automatic Data Processing via LLM Agents for Model Fine-tuning |
2026 |
VLDB |
4.1945683e-05 |
| 10,064 |
Cut Costs, Not Accuracy: LLM-Powered Data Processing with Guarantees |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,022 |
In-context Clustering-based Entity Resolution with Large Language Models: A Design Space Exploration |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,713 |
CoLA: Model Collaboration for Log-based Anomaly Detection |
2025 |
VLDB |
4.1945683e-05 |
| 7,020 |
LLM for Data Management |
2024 |
VLDB |
4.8595728e-05 |
| 4,154 |
Robust and Transferable Log-based Anomaly Detection |
2023 |
SIGMOD |
6.4032498e-05 |
| 6,897 |
PreLog: A Pre-trained Model for Log Analytics |
2024 |
SIGMOD |
4.8925595e-05 |