TY - GEN
T1 - SOM-based R∗-tree for similarity retrieval
AU - Oh, Kun Seok
AU - Feng, Yaokai
AU - Kaneko, K.
AU - Makinouchi, A.
AU - Bae, Sang Hyun
N1 - Publisher Copyright:
© 2001 IEEE.
PY - 2001
Y1 - 2001
N2 - Feature-based similarity retrieval has become an important research issue in multimedia database systems. The features of multimedia data are useful for discriminating between multimedia objects (e.g., documents, images, video, music score, etc.). For example, images are represented by their color histograms, texture vectors, and shape descriptors. A feature vector is a vector that represents a set of features, and are usually high-dimensional data. The performance of conventional multidimensional data structures (e.g., R-tree family K-D-B tree, grid file, TV-tree) tends to deteriorate as the number of dimensions of feature vectors increases. The R∗-tree is the most successful variant of the R-tree. We propose a SOM-based R∗-tree as a new indexing method for high-dimensional feature vectors. The SOM-based R∗-tree combines SOM and R∗-tree to achieve search performance more scalable to high dimensionalities. Self-organizing maps (SOMs) provide mapping from high-dimensional feature vectors onto a two-dimensional space. The mapping preserves the topology of the feature vectors. The map is called a topological feature map, and preserves the mutual relationships (similarity) in the feature spaces of input data, clustering mutually similar feature vectors in neighboring nodes. We experimentally compare the retrieval time cost of a SOM-based R∗-tree with that of an SOM and an R∗-tree using color feature vectors extracted from 40,000 images.
AB - Feature-based similarity retrieval has become an important research issue in multimedia database systems. The features of multimedia data are useful for discriminating between multimedia objects (e.g., documents, images, video, music score, etc.). For example, images are represented by their color histograms, texture vectors, and shape descriptors. A feature vector is a vector that represents a set of features, and are usually high-dimensional data. The performance of conventional multidimensional data structures (e.g., R-tree family K-D-B tree, grid file, TV-tree) tends to deteriorate as the number of dimensions of feature vectors increases. The R∗-tree is the most successful variant of the R-tree. We propose a SOM-based R∗-tree as a new indexing method for high-dimensional feature vectors. The SOM-based R∗-tree combines SOM and R∗-tree to achieve search performance more scalable to high dimensionalities. Self-organizing maps (SOMs) provide mapping from high-dimensional feature vectors onto a two-dimensional space. The mapping preserves the topology of the feature vectors. The map is called a topological feature map, and preserves the mutual relationships (similarity) in the feature spaces of input data, clustering mutually similar feature vectors in neighboring nodes. We experimentally compare the retrieval time cost of a SOM-based R∗-tree with that of an SOM and an R∗-tree using color feature vectors extracted from 40,000 images.
UR - http://www.scopus.com/inward/record.url?scp=84963860755&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84963860755&partnerID=8YFLogxK
U2 - 10.1109/DASFAA.2001.916377
DO - 10.1109/DASFAA.2001.916377
M3 - Conference contribution
AN - SCOPUS:84963860755
T3 - Proceedings - 7th International Conference on Database Systems for Advanced Applications, DASFAA 2001
SP - 182
EP - 189
BT - Proceedings - 7th International Conference on Database Systems for Advanced Applications, DASFAA 2001
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 7th International Conference on Database Systems for Advanced Applications, DASFAA 2001
Y2 - 18 April 2001 through 21 April 2001
ER -