Efficient nearest-neighbour searches using weighted euclidean metrics

Ruth Kurniawati, Jesse Jin, John Shepherd

Sixteenth British National Database Conference (BNCOD16), Cardiff, Wales, July 1998

(Compressed Postscript ... 354KB)


Nearest-neighbour search is an important operation in multimedia databases and many other applications. The similarity measurement usually involves weighting on some of the dimensions. Furthermore, the metric is subjective and weighting could vary. The existing tree structured spatial access methods cannot support this requirement directly. In an arbitrary distance metric, the nearest-neighbor disk will have an arbitrary shape and hence the intersection between the disk and the bounding rectangle of tree's nodes cannot be easily calculated. The commonly used method to do nearest-neighbour search using distances other than unweighted euclidean is not favourable for the weighted euclidean distance.

We show that we can do the search more efficiently by calculating a bounding envelope of the nearest-neighbour query region directly. Our solution is based on the observation that the weighted euclidean distance actually defines a transformation between spaces. The algorithms to calculate the bounding circle and bounding box of the query region directly from the weight matrix are provided. The time complexity of the algorithms are polynomial to the number of the dimensions and they are extensible directly to higher dimensional spaces. Using this approach, changes in the weight matrix can be handled easily with nominal computational overhead without having to rebuild the tree. We also provide an analytical and empirical results regarding the performance of our approach in terms of the number of disk pages touched.

Keys: nearest-neighbour search, weighted Euclidean distance, coordinate transformations, R-trees, spatial access methods


Recent Publications ... John Shepherd ... CSE ... UNSW