Performance modeling of a distributed web crawler using stochastic activity networks

Mitra Nasri, Saeed Shariati, Mohammad Abdollahi Azgomi

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

4 Citations (Scopus)

Abstract

One of the basic requirements of Web mining is a crawler system, which collects the information from the Web. To predict the performance, dependability and other operational measures of a system, it is required to construct and evaluate a formal model of the system. We have constructed a formal model for a distributed crawler, which is based on UbiCrawler, using stochastic activity networks (SANs). The constructed SAN model is used to evaluate some performance measures of the crawler. The results of the evaluation of throughput are same as the published statistics of UbiCrawler. In addition, we have been able to evaluate two other measures that are communication overhead and coverage. In this paper, we will discuss the architecture of the distributed crawler. Then, we will present a SAN model of the crawler and the results of its evaluation.

Original languageEnglish
Title of host publicationAdvances in Computer Science and Engineering
Subtitle of host publication13th International CSI Computer Conference, CSICC 2008 Kish Island, Iran, March 9-11, 2008 Revised Selected Papers
EditorsHamid Sarbazi-Azad
PublisherSpringer
Pages535-542
Number of pages8
ISBN (Print)3540899847, 9783540899846
DOIs
Publication statusPublished - 2008
Externally publishedYes
Event13th International Computer Society of Iran Computer Conference on Advances in Computer Science and Engineering, CSICC 2008 - Kish Island, Iran, Islamic Republic of
Duration: 9 Mar 200811 Mar 2008

Publication series

NameCommunications in Computer and Information Science
Volume6 CCIS
ISSN (Print)1865-0929

Conference

Conference13th International Computer Society of Iran Computer Conference on Advances in Computer Science and Engineering, CSICC 2008
Country/TerritoryIran, Islamic Republic of
CityKish Island
Period9/03/0811/03/08

Keywords

  • performance modeling
  • stochastic activity networks
  • Web crawler

Fingerprint

Dive into the research topics of 'Performance modeling of a distributed web crawler using stochastic activity networks'. Together they form a unique fingerprint.

Cite this