TCS-TR-A-07-27

Date: Fri May 18 23:06:20 2007

Title: A Theoretical Study on Variable Ordering of ZBDDs for Representing Frequent Itemsets

Authors: Shin-ichi Minato

Contact:

  • First name: Shin-ichi
  • Last name: Minato
  • Address: Division of Computer Science, Hokkaido University North 14, West 9, Sapporo 060-0814, Japan.
  • Email: minato@ist.hokudai.ac.jp

Abstract. Recently, an efficient method of database analysis using Zero-suppressed Binary Decision Diagrams (ZBDDs) has been proposed. BDDs are a graph-based representation of Boolean functions, now widely used in system design and verification. Here we focus on ZBDDs, a special type of BDDs, which are suitable for handling large-scale combinatorial itemsets in frequent itemset mining. In general, it is well-known that the size of ZBDDs greatly depends on variable ordering; however, in the specific cases of applying ZBDDs to data mining, the effect of variable ordering has not been studied well. In this paper, we present a theoretical study on ZBDD variable ordering for representing frequent itemsets. We show two typical databases we found out, where the ZBDD sizes are exponentially sensitive to the variable ordering. We also show that there is a case where the ZBDD size must be exponential in any variable ordering. Our theoretical results are helpful for developing a good heuristic method of variable ordering.


©Copyright 2007 Authors