Samenvatting
Distributed crawling is able to overcome Important limitations of the traditional single-sourced web crawling systems. However, the optimal benefit of distributed crawling is usually limited to the sites hosting the crawlers, the rest of the URLs are by large randomly distributed to the various crawlers. In this work, we propose a location-aware method, called IPMicra, that utilizes an IP address hierarchy, and allows crawling of links in a near optimal location aware manner. Our proposal outperforms earlier distributed crawling schemes by requiring one order of magnitude less time for crawling of the same set of sites.
| Originele taal-2 | Engels |
|---|---|
| Pagina's | 694-699 |
| Aantal pagina's | 6 |
| Status | Gepubliceerd - 1 dec. 2004 |
| Extern gepubliceerd | Ja |
| Evenement | International Conference on Internet Computing, IC'04 - Las Vegas, NV, Verenigde Staten van Amerika Duur: 21 jun. 2004 → 24 jun. 2004 |
Congres
| Congres | International Conference on Internet Computing, IC'04 |
|---|---|
| Land/Regio | Verenigde Staten van Amerika |
| Stad | Las Vegas, NV |
| Periode | 21/06/04 → 24/06/04 |
Vingerafdruk
Duik in de onderzoeksthema's van 'IPMicra: an IP-address based location aware distributed web crawler'. Samen vormen ze een unieke vingerafdruk.Citeer dit
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver