Database Paper Browser

Back to papers

Web Page Language Identification Based on URLs

Summary: URL-only language classifier to identify page language without fetching content. Across five languages (en, fr, de, es, it), F-measures up to 0.96 and recall up to 0.95, outperforming ccTLD baselines and enabling quota-aware crawling. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
9667
Venue
VLDB
Year
2008
Pagerank
-
Overall Rank
13,570 | 5.60%
DOI
-

Incoming Non-self Citations Over Time

No non-self incoming citations found for this paper in this database.

Authors

Incoming Citations (Sorted by Pagerank)

Showing 0 of 0 citing papers.

Rank Citing Paper Year Venue Pagerank
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 0 of 0 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
Previous Page 1 / 1 Next

Semantically Similar Papers