Performance modeling of a distributed web crawler using stochastic activity networks

Mitra Nasri, Saeed Shariati, Mohammad Abdollahi Azgomi

Onderzoeksoutput: Hoofdstuk in Boek/Rapport/CongresprocedureConferentiebijdrageAcademicpeer review

4 Citaten (Scopus)

Samenvatting

One of the basic requirements of Web mining is a crawler system, which collects the information from the Web. To predict the performance, dependability and other operational measures of a system, it is required to construct and evaluate a formal model of the system. We have constructed a formal model for a distributed crawler, which is based on UbiCrawler, using stochastic activity networks (SANs). The constructed SAN model is used to evaluate some performance measures of the crawler. The results of the evaluation of throughput are same as the published statistics of UbiCrawler. In addition, we have been able to evaluate two other measures that are communication overhead and coverage. In this paper, we will discuss the architecture of the distributed crawler. Then, we will present a SAN model of the crawler and the results of its evaluation.

Originele taal-2Engels
TitelAdvances in Computer Science and Engineering
Subtitel13th International CSI Computer Conference, CSICC 2008 Kish Island, Iran, March 9-11, 2008 Revised Selected Papers
RedacteurenHamid Sarbazi-Azad
UitgeverijSpringer
Pagina's535-542
Aantal pagina's8
ISBN van geprinte versie3540899847, 9783540899846
DOI's
StatusGepubliceerd - 2008
Extern gepubliceerdJa
Evenement13th International Computer Society of Iran Computer Conference on Advances in Computer Science and Engineering, CSICC 2008 - Kish Island, Iran
Duur: 9 mrt. 200811 mrt. 2008

Publicatie series

NaamCommunications in Computer and Information Science
Volume6 CCIS
ISSN van geprinte versie1865-0929

Congres

Congres13th International Computer Society of Iran Computer Conference on Advances in Computer Science and Engineering, CSICC 2008
Land/RegioIran
StadKish Island
Periode9/03/0811/03/08

Vingerafdruk

Duik in de onderzoeksthema's van 'Performance modeling of a distributed web crawler using stochastic activity networks'. Samen vormen ze een unieke vingerafdruk.

Citeer dit