Scalable Regular Expression Matching on Data Streams
Summary: End-to-end high-performance RE matching on data streams by marrying DFA throughput with NFA space efficiency. Core ideas: cache the frequent DFA states, cluster interacting REs to cap state blowup, enabling scalable matching under tight memory and outperforming a leading IDS. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Anirban Majumder
- 2. Rajeev Rastogi
- 3. Sriram Vanama
Incoming Citations (Sorted by Pagerank)
Showing 4 of 4 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,418 | ZStream: A Cost-based Query Processor for Adaptively Detecting Composite Events | 2009 | SIGMOD | 0.00012089363 |
| 1,429 | A Scalable, Predictable Join Operator for Highly Concurrent Data Warehouses | 2009 | VLDB | 0.00012033518 |
| 3,815 | High-Performance Dynamic Pattern Matching over Disordered Streams | 2010 | VLDB | 6.7333316e-05 |
| 6,351 | SigMatch: Fast and Scalable Multi-Pattern Matching | 2010 | VLDB | 5.1005697e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 2 of 2 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 622 | Towards an Internet-Scale XML Dissemination Service | 2004 | VLDB | 0.00019000333 |
| 3,526 | RE-Tree: An Efficient Index Structure for Regular Expressions | 2002 | VLDB | 7.0078308e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 7,738 | AFilter: Adaptable XML Filtering with Prefix-Caching and Suffix-Clustering | 2006 | VLDB | 4.6636747e-05 |
| 391 | Indexing and Querying XML Data for Regular Path Expressions | 2001 | VLDB | 0.00024564567 |
| 3,526 | RE-Tree: An Efficient Index Structure for Regular Expressions | 2002 | VLDB | 7.0078308e-05 |
| 5,031 | Event Pattern Matching over Graph Streams | 2015 | VLDB | 5.7499783e-05 |
| 9,157 | REmatch: a novel regex engine for finding all matches | 2023 | VLDB | 4.3849295e-05 |
| 5,404 | Practical Authenticated Pattern Matching with Optimal Proof Size | 2015 | VLDB | 5.5267144e-05 |
| 12,042 | E-Matching: Event Processing over Noisy Sequences in Real Time | 2013 | SIGMOD | 4.1945683e-05 |
| 3,815 | High-Performance Dynamic Pattern Matching over Disordered Streams | 2010 | VLDB | 6.7333316e-05 |
| 12,102 | Deterministic Regular Expressions in Linear Time | 2012 | PODS | 4.1945683e-05 |
| 776 | Efficient Pattern Matching over Event Streams | 2008 | SIGMOD | 0.00016799754 |