Some Potential Solutions
Partition-based
- use auxiliary (spatial) data structure to identify candidates
- space-partitioning methods: Grid file, k-d-b-tree, quadtree
- data-partitioning methods: R-tree, X-tree, SS-tree, TV-tree, ...
Approximation-based
- use approximating data structure to identify candidates
- signature files: VA-files
- projections: space-filling curves
Other Optimisations
- reduce I/O by reducing size of vectors (compression)
- reduce I/O by placing "similar" records together (clustering)
- reduce I/O by remembering previous pages (caching)
- reduce cpu by making D computation faster (ColorHist)
|