Abstract
Computing frequent itemsets is one of the most prominent problems in data mining. We study the following related problem, called FREQSAT, in depth: given some itemset-interval pairs, does there exist a database such that for every pair the frequency of the itemset falls into the interval? This problem is shown to be -complete. The problem is then further extended to include arbitrary Boolean expressions over items and conditional frequency expressions in the form of association rules. We also show that, unless equals , the related function problem–find the best interval for an itemset under some frequency constraints–cannot be approximated efficiently. Furthermore, it is shown that FREQSAT is recursively axiomatizable, but that there cannot exist an axiomatization of finite arity.
Original language | English |
---|---|
Pages (from-to) | 84-111 |
Journal | Theoretical Computer Science |
Volume | 394 |
Issue number | 1-2 |
DOIs | |
Publication status | Published - 2008 |