Latent Semantic Indexing: A Probabilistic Analysis
Summary: Gives a rigorous probabilistic analysis proving LSI recovers latent semantics and improves retrieval under specific generative conditions. Proposes random-projection acceleration and frames results as theoretical justification for spectral methods (e.g., collaborative filtering). (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 9 of 9 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 917 | Simrank++: Query Rewriting through Link Analysis of the Click Graph | 2008 | VLDB | 0.00015370124 |
| 3,000 | SANTOS: Relationship-based Semantic Table Union Search | 2023 | SIGMOD | 7.7462128e-05 |
| 3,485 | Using taxonomy, discriminants, and signatures for navigating in text databases | 1997 | VLDB | 7.0504959e-05 |
| 3,682 | Database-friendly Random Projections | 2001 | PODS | 6.8484436e-05 |
| 4,187 | Clustering via Matrix Powering | 2004 | PODS | 6.3754336e-05 |
| 6,325 | On the Effects of Dimensionality Reduction on High Dimensional Similarity Search | 2001 | PODS | 5.1105081e-05 |
| 8,780 | Applications of linear algebra in information retrieval and hypertext analysis | 1999 | PODS | 4.4537086e-05 |
| 12,276 | Parsimonious Linear Fingerprinting for Time Series | 2010 | VLDB | 4.1945683e-05 |
| 12,669 | Self-similarity in the web | 2001 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 1 of 1 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 72 | Combining Fuzzy Information from Multiple Systems | 1996 | PODS | 0.00058577335 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,128 | An Efficient Indexing Technique for Full-Text Database Systems | 1992 | VLDB | 0.00013794088 |
| 4,092 | Structured Annotations of Web Queries | 2010 | SIGMOD | 6.4561959e-05 |
| 8,632 | Measuring the Structural Similarity of Semistructured Documents Using Entropy | 2007 | VLDB | 4.4803734e-05 |
| 328 | An Architecture for Parallel Topic Models | 2010 | VLDB | 0.0002728514 |
| 6,325 | On the Effects of Dimensionality Reduction on High Dimensional Similarity Search | 2001 | PODS | 5.1105081e-05 |
| 11,504 | LES3: Learning-based Exact Set Similarity Search | 2021 | VLDB | 4.1945683e-05 |
| 400 | Multi-Probe LSH: Efficient Indexing for High-Dimensional Similarity Search | 2007 | VLDB | 0.0002427237 |
| 7,522 | Efficient and Tunable Similar Set Retrieval | 2001 | SIGMOD | 4.7180617e-05 |
| 4,988 | Incremental Maintenance of Length Normalized Indexes for Approximate String Matching | 2009 | SIGMOD | 5.783959e-05 |
| 8,780 | Applications of linear algebra in information retrieval and hypertext analysis | 1999 | PODS | 4.4537086e-05 |