Association Computation for Information Access*
(invited lecture for DS 2003)
Author: Akihiko Takano
Affiliation: National Institute of Informatics,
Software Research Division, Tokyo, Japan
Abstract.
GETA (Generic Engine for Transposable Association)
is a software that provides efficient generic computation for association.
It enables the quantitative analysis of various proposed methods
based on association, such as measuring similarity among documents or
words. Scalable implementation of GETA can handle large corpora of twenty
million documents, and provides the implementation basis
for the effective information access of next generation.
DualNAVI is an information retrieval system which is a successful
example to show the power and the flexibility of GETA-based computation
for association.
It provides the users with rich interaction
both in document space and in word space.
Its dual view interface always returns the retrieved results in two views:
a list of titles for document space and
``Topic Word Graph'' for word space.
They are tightly coupled by their cross-reference relation,
and inspires the users with further interactions.
The two-stage approach in the associative search,
which is the key to its efficiency,
also facilitates the content-based correlation among databases.
In this paper we describe the basic features of GETA and DualNAVI.
*The full version of this paper is
published in the Proceedings of the 6th International Conference
on Discovery Science, Lecture Notes in Artificial Intelligence Vol. 2843
©Copyright 2003 SPRINGER
|