Doorgaan naar hoofdnavigatie Doorgaan naar zoeken Ga verder naar hoofdinhoud

IPMicra: an IP-address based location aware distributed web crawler

Onderzoeksoutput: Bijdrage aan congresPaperAcademic

Samenvatting

Distributed crawling is able to overcome Important limitations of the traditional single-sourced web crawling systems. However, the optimal benefit of distributed crawling is usually limited to the sites hosting the crawlers, the rest of the URLs are by large randomly distributed to the various crawlers. In this work, we propose a location-aware method, called IPMicra, that utilizes an IP address hierarchy, and allows crawling of links in a near optimal location aware manner. Our proposal outperforms earlier distributed crawling schemes by requiring one order of magnitude less time for crawling of the same set of sites.

Originele taal-2Engels
Pagina's694-699
Aantal pagina's6
StatusGepubliceerd - 1 dec. 2004
Extern gepubliceerdJa
EvenementInternational Conference on Internet Computing, IC'04 - Las Vegas, NV, Verenigde Staten van Amerika
Duur: 21 jun. 200424 jun. 2004

Congres

CongresInternational Conference on Internet Computing, IC'04
Land/RegioVerenigde Staten van Amerika
StadLas Vegas, NV
Periode21/06/0424/06/04

Vingerafdruk

Duik in de onderzoeksthema's van 'IPMicra: an IP-address based location aware distributed web crawler'. Samen vormen ze een unieke vingerafdruk.

Citeer dit