节点文献

LSNCCP——一种基于最大不相含核心点集的聚类算法

LSNCCP:A Clustering Algorithm Based on the Largest Set of Not-Covered Core Points

  • 推荐 CAJ下载
  • PDF下载
  • 不支持迅雷等下载工具,请取消加速工具后下载。

【作者】 薛永生翁伟文娟王劲波张宇

【Author】 XUE Yong-Sheng1, WENG Wei1, WEN Juan1, WANG Jing-Bo2, and ZHANG Yu1 1(Department of Computer Science, Xiamen University,Xiamen 361005) 2(Economic School, Xiamen University,Xiamen 361005)

【机构】 厦门大学计算机科学系厦门大学经济学院厦门大学计算机科学系 厦门361005厦门361005厦门361005

【摘要】 聚类在数据挖掘、模式识别等许多领域有着重要的应用 提出了一种新颖的聚类算法 :一种基于最大不相含核心点集的聚类算法LSNCCP(aclusteringalgorithmbasedonthelargestsetofnot coveredcorepoints) 在密度定义的基础上 ,考察核心点之间的距离关系 ,定义相含、相交、相离这 3种核心点之间的关系 ,最后找出一个最大不相含核心点集 ,在此基础上进行聚类 ,并且找到解决丢失点问题的快速方法 该最大不相含核心点集只是全部核心点集合的一个很小的子集 ,因此有效地缩减了同类算法中搜寻核心点的时间 理论和实验上证明了这种算法的可行性和优越性

【Abstract】 Clustering is an important application area for many fields including data mining, pattern recognition, etc. In this paper, a novel clustering algorithm LSNCCP(a clustering algorithm based on the largest set of not-covered core points) is proposed. On the basis of the definition of density, the distance between the core points is discussed. And then, the three essential distance relation: covered core points, intersectant core points, and separate core points. Finally, the largest set of not-covered core points is found and based on the set the data can cluster very well. Because the largest set of not-covered core points is a lesser subset of the all core points, the new algorithm cuts short the time of searching all core points in the similar algorithms. The feasibility and the advantage or the new algorithm are proved in theory and experiment.

【基金】 福建省自然科学基金项目 (A0 3 10 0 0 8) ;福建省高新技术研究开放计划重点项目 (2 0 0 3H0 43 )
  • 【文献出处】 计算机研究与发展 ,Journal of Computer Research and Development , 编辑部邮箱 ,2004年11期
  • 【分类号】TP311.13
  • 【被引频次】8
  • 【下载频次】154
节点文献中: