Distributed location aware web crawling

Odysseas Papapetrou, George Samaras

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

4 Citations (Scopus)

Abstract

Distributed crawling has shown that it can overcome important limitations of the today's crawling paradigm. However, the optimal benefits of this approach are usually limited to the sites hosting the crawler. In this work, we propose a location-Aware method, called IPMicra, that utilizes an IP address hierarchy, and allows crawling of links in a near optimal location aware manner.

Original languageEnglish
Title of host publicationProceedings of the 13th International World Wide Web Conference on Alternate Track, Papers and Posters, WWW Alt. 2004
PublisherAssociation for Computing Machinery, Inc
Pages468-469
Number of pages2
ISBN (Electronic)1581139128, 9781581139129
DOIs
Publication statusPublished - 19 May 2004
Externally publishedYes
Event13th International World Wide Web Conference on Alternate Track, Papers and Posters, WWW Alt. 2004 - New York, United States
Duration: 19 May 200421 May 2004

Conference

Conference13th International World Wide Web Conference on Alternate Track, Papers and Posters, WWW Alt. 2004
CountryUnited States
CityNew York
Period19/05/0421/05/04

Keywords

  • Distributed web crawling
  • Location aware web crawling

Cite this

Papapetrou, O., & Samaras, G. (2004). Distributed location aware web crawling. In Proceedings of the 13th International World Wide Web Conference on Alternate Track, Papers and Posters, WWW Alt. 2004 (pp. 468-469). Association for Computing Machinery, Inc. https://doi.org/10.1145/1013367.1013529
Papapetrou, Odysseas ; Samaras, George. / Distributed location aware web crawling. Proceedings of the 13th International World Wide Web Conference on Alternate Track, Papers and Posters, WWW Alt. 2004. Association for Computing Machinery, Inc, 2004. pp. 468-469
@inproceedings{bd5fa2b11a8f4ee9889dff31dc6d26fe,
title = "Distributed location aware web crawling",
abstract = "Distributed crawling has shown that it can overcome important limitations of the today's crawling paradigm. However, the optimal benefits of this approach are usually limited to the sites hosting the crawler. In this work, we propose a location-Aware method, called IPMicra, that utilizes an IP address hierarchy, and allows crawling of links in a near optimal location aware manner.",
keywords = "Distributed web crawling, Location aware web crawling",
author = "Odysseas Papapetrou and George Samaras",
year = "2004",
month = "5",
day = "19",
doi = "10.1145/1013367.1013529",
language = "English",
pages = "468--469",
booktitle = "Proceedings of the 13th International World Wide Web Conference on Alternate Track, Papers and Posters, WWW Alt. 2004",
publisher = "Association for Computing Machinery, Inc",
address = "United States",

}

Papapetrou, O & Samaras, G 2004, Distributed location aware web crawling. in Proceedings of the 13th International World Wide Web Conference on Alternate Track, Papers and Posters, WWW Alt. 2004. Association for Computing Machinery, Inc, pp. 468-469, 13th International World Wide Web Conference on Alternate Track, Papers and Posters, WWW Alt. 2004, New York, United States, 19/05/04. https://doi.org/10.1145/1013367.1013529

Distributed location aware web crawling. / Papapetrou, Odysseas; Samaras, George.

Proceedings of the 13th International World Wide Web Conference on Alternate Track, Papers and Posters, WWW Alt. 2004. Association for Computing Machinery, Inc, 2004. p. 468-469.

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

TY - GEN

T1 - Distributed location aware web crawling

AU - Papapetrou, Odysseas

AU - Samaras, George

PY - 2004/5/19

Y1 - 2004/5/19

N2 - Distributed crawling has shown that it can overcome important limitations of the today's crawling paradigm. However, the optimal benefits of this approach are usually limited to the sites hosting the crawler. In this work, we propose a location-Aware method, called IPMicra, that utilizes an IP address hierarchy, and allows crawling of links in a near optimal location aware manner.

AB - Distributed crawling has shown that it can overcome important limitations of the today's crawling paradigm. However, the optimal benefits of this approach are usually limited to the sites hosting the crawler. In this work, we propose a location-Aware method, called IPMicra, that utilizes an IP address hierarchy, and allows crawling of links in a near optimal location aware manner.

KW - Distributed web crawling

KW - Location aware web crawling

UR - http://www.scopus.com/inward/record.url?scp=66349112068&partnerID=8YFLogxK

U2 - 10.1145/1013367.1013529

DO - 10.1145/1013367.1013529

M3 - Conference contribution

AN - SCOPUS:66349112068

SP - 468

EP - 469

BT - Proceedings of the 13th International World Wide Web Conference on Alternate Track, Papers and Posters, WWW Alt. 2004

PB - Association for Computing Machinery, Inc

ER -

Papapetrou O, Samaras G. Distributed location aware web crawling. In Proceedings of the 13th International World Wide Web Conference on Alternate Track, Papers and Posters, WWW Alt. 2004. Association for Computing Machinery, Inc. 2004. p. 468-469 https://doi.org/10.1145/1013367.1013529