Association Computation for Information Access*
(invited lecture for DS 2003)

Author: Akihiko Takano

Affiliation: National Institute of Informatics, Software Research Division, Tokyo, Japan

Abstract. GETA (Generic Engine for Transposable Association) is a software that provides efficient generic computation for association. It enables the quantitative analysis of various proposed methods based on association, such as measuring similarity among documents or words. Scalable implementation of GETA can handle large corpora of twenty million documents, and provides the implementation basis for the effective information access of next generation. DualNAVI is an information retrieval system which is a successful example to show the power and the flexibility of GETA-based computation for association. It provides the users with rich interaction both in document space and in word space. Its dual view interface always returns the retrieved results in two views: a list of titles for document space and ``Topic Word Graph'' for word space. They are tightly coupled by their cross-reference relation, and inspires the users with further interactions. The two-stage approach in the associative search, which is the key to its efficiency, also facilitates the content-based correlation among databases. In this paper we describe the basic features of GETA and DualNAVI.


*The full version of this paper is published in the Proceedings of the 6th International Conference on Discovery Science, Lecture Notes in Artificial Intelligence Vol. 2843


©Copyright 2003 SPRINGER