Auto-Grouping Emails For Faster E-Discovery
Summary: Auto-grouping emails for e-discovery via three modes: syntactic near-duplicate clusters, semantic concept-based groups, and thread-aware segmentation. Enron experiments show reduced review time and high precision/recall; integration into IBM eDiscovery Analyzer. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
- 1. Sachindra Joshi
- 2. Danish Contractor
- 3. Kenney Ng
- 4. Prasad M Deshpande
- 5. Thomas Hampp
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 1 of 1 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 616 | Copy Detection Mechanisms for Digital Documents | 1995 | SIGMOD | 0.00019108201 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 12,223 | Schema Clustering and Retrieval for Multi-domain Pay-As-You-Go Data Integration Systems | 2010 | SIGMOD | 4.1945683e-05 |
| 9,855 | Progressive Entity Matching: A Design Space Exploration | 2025 | SIGMOD | 4.269353e-05 |
| 12,814 | Type Classification of Semi-Structured Documents | 1995 | VLDB | 4.1945683e-05 |
| 6,684 | Interesting-Phrase Mining for Ad-Hoc Text Analytics | 2010 | VLDB | 4.9629004e-05 |
| 5,379 | Scalable Ad-hoc Entity Extraction from Text Collections | 2008 | VLDB | 5.5405989e-05 |
| 3,485 | Using taxonomy, discriminants, and signatures for navigating in text databases | 1997 | VLDB | 7.0504959e-05 |
| 4,250 | Local Similarity Search for Unstructured Text | 2016 | SIGMOD | 6.3241139e-05 |
| 11,673 | Online Template Induction for Machine-Generated Emails | 2019 | VLDB | 4.1945683e-05 |
| 4,951 | Mining Document Collections to Facilitate Accurate Approximate Entity Matching | 2009 | VLDB | 5.8100413e-05 |
| 3,426 | Discovering Topical Structures of Databases | 2008 | SIGMOD | 7.1063105e-05 |