Spatial Indexing Part 1: Why a Spatial Index?
Hi Folks,
If you've seen one of my recent spatial talks, you've seen me walk through the basics of how our indexing works. I'm going to do roughly the same thing here over a series of blog posts, except that I'm (eventually) going to go into a few more details. We'll start with some basics, and work down to what we actually do.
All spatial databases (that I'm aware of) use the same basic strategy: when presented with a spatial predicate---e.g., and intersection---they work by splitting the predicate into two parts. These two parts are called the primary filter and the secondary filter. To a first a proximation, the secondary filter is the original predicate, and the primary filter is some other operator that has three properties:
- It may produce false positives---objects may pass the filter that do not satisfy the predicate---but no false negatives.
- It doesn't produce too many false positives---if it does it won't be effective.
- It is fast.
To make this concrete, assume I have a database of all of the roads in the United States. If I want those roads that intersect Madison, Wisconsin, a (poor) primary filter may quickly throw out all of the roads that do not lie within the state of Wisconsin. There are certainly roads in Wisconsin that do not intersect Madison. That these pass the primary filter is fine; it's critical however that no roads intersecting Madison are removed by the primary filter. We can now do the intersection operation on those roads that remain and be guaranteed that our final answer has all of the right results.
Why go to all of this trouble? First, because the secondary filter---the actual intersection operator---is slow. If we can eliminate as many calls as possible to this complicated procedure we'll come out ahead. Second, because we'd actually like to achieve sublinear behavior, and we can't do that if we have to operate on each row of our database.
If you haven't guessed, the link to indexing is that, quite simply, the spatial index is the primary filter.
Cheers,
-Isaac
Next up: A simple spatial indexing scheme.
Comments
Anonymous
November 27, 2007
Hi Folks, Let's continue with the discussion of spatial indexing form last time . If you already knowAnonymous
December 05, 2007
Hi Folks, Last time we talked about a simple primary filter. (See previous installments: 1 , 2 .) WeAnonymous
February 05, 2008
Hi Folks, Well, I never did complete my series of posts describing our indexing scheme. I guess I oughtAnonymous
February 05, 2008
Hi Folks, Well, I never did complete my series of posts describing our indexing scheme. I guess I oughtAnonymous
February 10, 2008
PASSJにでも書いてろって内容だけど...わんくまでやるお!Anonymous
February 10, 2008
PASSJにでも書いてろって内容だけど...わんくまでやるお!Anonymous
May 28, 2009
Hi Folks, This post contains no new information; it’s just a rollup of links to spatial indexing posts