An accurate selectivity estimation method for window queries and an implementation thereof |
| |
Authors: | Changxiu Cheng Jing Yang Xiaomei Song Shanli Yang Lijun Wang |
| |
Affiliation: | 1. Academy of Disaster Reduction and Emergency Management, Beijing Normal University, Beijing 100875, China;2. State Key Laboratory of Resources and Environmental Information Systems, Institute of Geographic Science and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, Chinachengcx@bnu.edu.cn;4. State Key Laboratory of Resources and Environmental Information Systems, Institute of Geographic Science and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China;5. China Internet Network Information Center, Beijing 100101, China |
| |
Abstract: | Spatial selectivity estimation is crucial to choose the cheapest execution plan for a given query in a query optimizer. This article proposes an accurate spatial selectivity estimation method based on the cumulative density (CD) histograms, which can deal with any arbitrary spatial query window. In this method, the selectivity can be estimated in original logic of the CD histogram, after the four corner values of a query window have been accurately interpolated on the continuous surface of the elevation histogram. For the interpolation of any corner points, we first identify the cells that can affect the value of point (x, y) in the CD histogram. These cells can be categorized into two classes: ones within the range from (0, 0) to (x, y) and the other overlapping the range from (0, 0) to (x, y). The values of the former class can be used directly, whereas we revise the values of any cells falling in the latter class by the number of vertices in the corresponding cell and the area ratio covered by the range from (0, 0) to (x, y). This revision makes the estimation method more accurate. The CD histograms and estimation method have been implemented in INGRES. Experiment results show that the method can accurately estimate the selectivity of arbitrary query windows and can help the optimizer choose a cheaper query plan. |
| |
Keywords: | cumulative density (CD) histogram selectivity estimation window queries spatial database spatial query optimization |
|
|