A taxonomy of sublinear multiple keyword pattern matching algerithms

B.W. Watson, G. Zwaan

Research output: Contribution to journalArticleAcademicpeer-review

18 Citations (Scopus)
2 Downloads (Pure)

Abstract

This article presents a taxonomy of sublinear keyword pattern matching algorithms related to the Boyer-Moore algorithm [3] and the Commentz-Walter algorithm [5, 6]. The taxonomy includes, amongst others, the multiple keyword generalization of the single keyword Boyer-Moore algorithm and an algorithm by Fan and Su [9, 10]. The corresponding precomputation algorithms are presented as well. The taxonomy is based on the idea of ordering algorithms according to their essential problem and algorithm details, and deriving all algorithms from a common starting point by successively adding these details in a correctness preserving way. This way of presentation not only provides a complete correctness argument of each algorithm, but also makes very clear what algorithms have in common (the details of their nearest common ancestor) and where they differ (the details added after their nearest common ancestor). Introduction of the notion of safe shift distances proves to be essential in the derivation and classification of the algorithms. Moreover, the article provides a common derivation for and a uniform presentation of the precomputation algorithms, not yet found in the literature.
Original languageEnglish
Pages (from-to)85-118
JournalScience of Computer Programming
Volume27
Issue number2
DOIs
Publication statusPublished - 1996

Fingerprint Dive into the research topics of 'A taxonomy of sublinear multiple keyword pattern matching algerithms'. Together they form a unique fingerprint.

Cite this