TCS-TR-A-06-22

Date: Sat Nov 11 16:00:09 2006

Title: Symmetric Item Set Mining Method Using ZBDDs and Application to Biological Data

Authors: Shin-ichi Minato and Kimihito Ito

Contact:

  • First name: Shin-ichi
  • Last name: Minato
  • Address: Division of Computer Science, Hokkaido University, North 14 West 9, Sapporo, 060-0814 Japan.
  • Email: minato@ist.hokudai.ac.jp

Abstract. In this paper, we present a method of finding symmetric items in a combinatorial item set database. The techniques for finding symmetric variables in Boolean functions have been studied for long time in the area of VLSI logic design, and the BDD (Binary Decision Diagram) -based methods are presented to solve such a problem. Recently, we have developed an efficient method for handling databases using ZBDDs (Zero-suppressed BDDs), a particular type of BDDs. In our ZBDD-based data structure, the symmetric item sets can be found efficiently as well as for Boolean functions. We implemented the program of symmetric item set mining, and applied it to actual biological data on the amino acid sequences of influenza viruses. We found a number of symmetric items from the database, some of which indicate interesting relationships in the amino acid mutation patterns. The result shows that our method is helpful for extracting hidden interesting information in real-life databases.


©Copyright 2006 Authors