节点文献

融合网格密度的聚类中心初始化方案

A Cell Density-Enabled Schema for Initializing Cluster Centers

  • 推荐 CAJ下载
  • PDF下载
  • 不支持迅雷等下载工具,请取消加速工具后下载。

【作者】 牛琨张舒博陈俊亮

【Author】 NIU Kun1,ZHANG Shu-bo2,CHEN Jun-liang1 (1.School of Computer Science and Technology,Beijing University of Posts and Telecommunications,Beijing 100876,China;2.Department of Strategy Research,China Telecom Beijing Research Institute,Beijing 100035,China)

【机构】 北京邮电大学计算机科学与技术学院中国电信北京研究院决策研究部北京邮电大学计算机科学与技术学院 北京100876北京100035北京100876

【摘要】 提出了一种采用密度指针的聚类中心初始化方法——density pointer(DP)算法.DP算法以网格单元的几何中心为对称中心,连接该中心与网格单元各顶点,以此对称分割传统的类矩形网格单元,形成超三角形子空间;进而根据各个超三角形子空间与邻居单元相邻的超三角形子空间的密度差异确定密度指针的方向,并根据密度指针计算出每个密集网格单元的聚集因子;最后将具有较大局部聚集因子的网格单元族的重心作为初始聚类中心.在公开数据集和人工数据集上的实验结果表明,DP算法能快速高效地找到接近于真实聚类中心的数据点作为初始聚类中心.针对算法的效率实验表明,DP算法的时间开销与数据集实例数、维度以及网格单元数量均呈一阶线性关系.

【Abstract】 A new algorithm using density pointer is proposed to initialize cluster centers. The density pointer(DP) algorithm takes the geometric centers of grid cells as symmetrical centers.With the interconnections between these centers and the vertices of grid cells,DP partitions traditional rectangular-like grid cells into hyper triangle-like subspaces.The density differences between hypertriangle subspaces and the corresponding subspaces of their neighborhoods are considered to define density pointers.After that,DP will detect density pointers to calculate the aggregation factors of dense cells and then takes the gravity centers of the cells with larger local aggregation factors as initial cluster centers.Experiments on both public and real datasets show that DP is helpful to find cluster centers near to real centers quickly and effectively.Moreover,the running time of DP is linear with respect to the number of instances,the number of grid cells and the dimensions.

【基金】 国家自然科学基金项目(60432010);国家“973计划”项目(2007CB307100)
  • 【文献出处】 北京邮电大学学报 ,Journal of Beijing University of Posts and Telecommunications , 编辑部邮箱 ,2007年02期
  • 【分类号】TP311.13
  • 【被引频次】43
  • 【下载频次】360
节点文献中: 

本文链接的文献网络图示:

本文的引文网络