共查询到18条相似文献,搜索用时 62 毫秒
1.
提出了一种融合图论与密度思想的空间聚类方法——HGDSC。该方法首先借助附加约束的Delau-nay三角网来建立空间实体之间的邻接关系,然后对基于密度的聚类方法进行改进,顾及空间邻近与非空间属性相似性进行聚类。特别地,该方法只需要一个输入参数。模拟数据和实际数据验证表明,HGDSC方法能够发现任意形状和密度变化的空间簇,并且可以很好地识别噪声点。 相似文献
2.
空间和属性双重约束下的自组织空间聚类研究 总被引:2,自引:0,他引:2
形式化定义了双重聚类的聚类准则及其判定方法,提出了双重聚类的两步法求解思路和自组织双重聚类算法。通过实例验证了该算法的可行性,自组织双重聚类可以发现非空间属性的聚集、延伸等空间分布特征,可以发现任意复杂形状的聚类,并降低了人为影响。 相似文献
3.
针对Delaunay三角网空间聚类存在的不足,提出一种顾及属性空间分布不均的空间聚类方法。首先将Delaunay三角网空间位置聚类作为约束条件,采用广度优先搜索方法,以局部参数"属性变化率"作为阈值识别非空间属性相似簇的聚类过程。以城市商业中心为例,验证了该方法能够更客观地识别非空间属性相似的簇,且自适应属性阈值可以满足不同聚类需求,为城市商业中心等空间实体的提取提供了一种有效方法。 相似文献
4.
5.
非监督分类中初始聚类中心法的比较研究 总被引:1,自引:0,他引:1
遥感影像的非监督分类中,初始聚类中心的选取对分类过程和分类结果具有重要影响,好的初始聚类中心法既能提高分类的效率又能提高分类的精度。选取类间距离和类内标准差作为评价标准对现有的几种初始聚类法进行比较。结果表明,最大最小距离选心法具有较高的分类精度,但是效率较低;而基于均值标准差定心法精度较低,但效率较高。 相似文献
6.
为了使得空间聚类分析更加适应实际情况,发展了一种同时顾及空间障碍约束与空间位置邻近的空间聚类方法。该方法采用Delaunay三角网描述实体间的邻近关系,并且不依赖用户指定参数。实验验证了本方法的有效性与优越性。 相似文献
7.
8.
区域划分是依据人口和社会经济指标将行政统计单元或其他地理实体划分成若干个不同水平或类别的集合。由于大多数的人口和社会经济指标来源于面状数据-行政统计单元,常用的区域划分的空间聚类方法是基于面状数据的,本文通过分析现有面状数据的聚类算法特点和不足,进而提出一种新的算法,该方法提出将面状统计单元进行网格划分,引入基于网格密度聚类算法的思想,克服现有面状聚类的诸多缺点,打破行政区划的限制,更好地发现潜在信息。 相似文献
9.
10.
11.
Spatial clustering is widely used in many fields such as WSN (Wireless Sensor Networks), web clustering, remote sensing and so on for discovery groups and to identify interesting distributions in the underlying database. By discussing the relationships between the optimal clustering and the initial seeds, a clustering validity index and the principle of seeking initial seeds were proposed, and on this principle we recommend an initial seed-seeking strategy: SSPG (Single-Shortest-Path Graph). With SSPG strategy used in clustering algorithms, we find that the result of clustering is optimized with more probability. At the end of the paper, according to the combinational theory of optimization, a method is proposed to obtain optimal reference k value of cluster number, and is proven to be efficient. 相似文献
12.
Spatial clustering is widely used in many fields such as WSN (Wireless Sensor Networks), web clustering, remote sensing and so on for discovery groups and to identify interesting distributions in the underlying database. By discussing the relationships between the optimal clustering and the initial seeds, a clustering validity index and the principle of seeking initial seeds were proposed, and on this principle we recommend an initial seed-seeking strategy: SSPG (Single-Shortest-Path Graph). With SSPG strategy used in clustering algorithms, we find that the result of clustering is optimized with more probability. At the end of the paper, according to the combinational theory of optimization, a method is proposed to obtain optimal reference k value of cluster number, and is proven to be efficient. 相似文献
13.
14.
空间数据模糊聚类的有效性(英文) 总被引:1,自引:0,他引:1
The validity measurement of fuzzy clustering is a key problem. If clustering is formed, it needs a kind of machine to verify its validity. To make mining more accountable, comprehensible and with a usable spatial pattern, it is necessary to first detect whether the data set has a clustered structure or not before clustering. This paper discusses a detection method for clustered patterns and a fuzzy clustering algorithm, and studies the validity function of the result produced by fuzzy clustering based on two aspects, which reflect the uncertainty of classification during fuzzy partition and spatial location features of spatial data, and proposes a new validity function of fuzzy clustering for spatial data. The experimental result indicates that the new validity function can accurately measure the validity of the results of fuzzy clustering. Especially, for the result of fuzzy clustering of spatial data, it is robust and its classification result is better when compared to other indices. 相似文献
15.
16.
ZHOU Jiaogen GUAN Jihong LI Pingxiang 《地球空间信息科学学报》2007,10(2):137-144
Spatial objects have two types of attributes: geometrical attributes and non-geometrical attributes, which belong to two different attribute domains (geometrical and non-geometrical domains). Although geometrically scattered in a geometrical domain, spatial objects may be similar to each other in a non-geometrical domain. Most existing clustering algorithms group spatial datasets into different compact regions in a geometrical domain without considering the aspect of a non-geometrical domain. However, many application scenarios require clustering results in which a cluster has not only high proximity in a geometrical domain, but also high similarity in a non-geometrical domain. This means constraints are imposed on the clustering goal from both geometrical and non-geometrical domains simultaneously. Such a clustering problem is called dual clustering. As distributed clustering applications become more and more popular, it is necessary to tackle the dual clustering problem in distributed databases. The DCAD algorithm is proposed to solve this problem. DCAD consists of two levels of clus- tering: local clustering and global clustering. First, clustering is conducted at each local site with a local clustering algorithm, and the features of local clusters are extracted. Second, local features from each site are sent to a central site where global clustering is obtained based on those features. Experiments on both artificial and real spatial datasets show that DCAD is effective and efficient. 相似文献
17.
Spatial objects have two types of attributes: geometrical attributes and non-geometrical attributes, which belong to two different attribute domains (geometrical and non-geometrical domains). Although geometrically scattered in a geometrical domain, spatial objects may be similar to each other in a non-geometrical domain. Most existing clustering algorithms group spatial datasets into different compact regions in a geometrical domain without considering the aspect of a non-geometrical domain. However, many application scenarios require clustering results in which a cluster has not only high proximity in a geometrical domain, but also high similarity in a non-geometrical domain. This means constraints are imposed on the clustering goal from both geometrical and non-geometrical domains simultaneously. Such a clustering problem is called dual clustering. As distributed clustering applications become more and more popular, it is necessary to tackle the dual clustering problem in distributed databases. The DCAD algorithm is proposed to solve this problem. DCAD consists of two levels of clustering: local clustering and global clustering. First, clustering is conducted at each local site with a local clustering algorithm, and the features of local clusters are extracted. Second, local features from each site are sent to a central site where global clustering is obtained based on those features. Experiments on both artificial and real spatial datasets show that DCAD is effective and efficient. 相似文献