Availability and accuracy of distributed web crawlers: A model-based evaluation

Mitra Nasri, Saeed Shariati, Mohsen Sharifi

Onderzoeksoutput: Hoofdstuk in Boek/Rapport/CongresprocedureConferentiebijdrageAcademicpeer review

3 Citaten (Scopus)

Samenvatting

Distributed Web crawlers are extensively used for Web mining nowadays, but their accuracy, dependability and other operational measures have not been fully studied. Distributed Web crawlers are costly and require careful selection of configuration parameters. It is important to have some estimation about the performance, dependability and accuracy of a Web crawler. This paper presents a model-based evaluation of the accuracy and availability of a distributed Web crawler whose architecture is based on UbiCrawler. Stochastic activity networks are used for modelling the crawler. Accuracy and availability of the Web crawler are formally defined, and the effects of environmental failure rates on crawling nodes and on the availability of the whole system are discussed.

Originele taal-2Engels
TitelProceedings - EMS 2008, European Modelling Symposium, 2nd UKSim European Symposium on Computer Modelling and Simulation
UitgeverijInstitute of Electrical and Electronics Engineers
Pagina's453-458
Aantal pagina's6
ISBN van geprinte versie9780769533254
DOI's
StatusGepubliceerd - 2008
Extern gepubliceerdJa
EvenementEMS 2008, European Modelling Symposium, 2nd UKSim European Symposium on Computer Modelling and Simulation - Liverpool, Verenigd Koninkrijk
Duur: 8 sep. 200810 sep. 2008

Congres

CongresEMS 2008, European Modelling Symposium, 2nd UKSim European Symposium on Computer Modelling and Simulation
Land/RegioVerenigd Koninkrijk
StadLiverpool
Periode8/09/0810/09/08

Vingerafdruk

Duik in de onderzoeksthema's van 'Availability and accuracy of distributed web crawlers: A model-based evaluation'. Samen vormen ze een unieke vingerafdruk.

Citeer dit