节点文献

增量式频繁闭合序列挖掘算法

An incremental algorithm for mining frequent closed patterns

  • 推荐 CAJ下载
  • PDF下载
  • 不支持迅雷等下载工具,请取消加速工具后下载。

【作者】 石怀东蔡铭吴洪森董金祥富浩

【Author】 SHI Huai-dong 1,CAI Ming 1,WU Hong-sen 2,DONG Jin-xiang 1,FU Hao 1 (1.College of Computer Science and Technology,Zhejiang University,Hangzhou 310027,China; 2.Zhejiang Police College,Hangzhou 310053,China)

【机构】 浙江大学计算机科学与技术学院浙江警察学院

【摘要】 在许多场合挖掘频繁闭合序列时,输入串数据库呈现实时动态增长的特点.分析Bide算法,给出并证明了闭合序列前缀中任意一个项目的后向扩展事件(BEE)项目交集随前缀的生长单调不增的定理,据此对BEE累计操作进行了优化,使其性能平均提高了4.8%.定义了闭合序列树作为频繁闭合序列的表示形式,并阐述了它的3个性质.分析发现,当新增输入串不同时包含前缀串和频繁项目时,两次连续挖掘的结果是相同的,给出了相应的定理和证明,据此实现了增量式频繁闭合序列挖掘算法BideInc.实验验证了BideInc算法的正确性,使用该算法后挖掘性能平均提高了47%.

【Abstract】 While mining frequent closed patterns(FCP),the input sequence database dynamically increases in many situations.By analyzing Bide algorithm,the theorem of backward-extension event(BEE) detection was proposed and proved.It shows that the BEE set of any prefix item is non-increasing with the extension of the prefix.Based on the theorem,the accumulation performance of the BEE set was optimized by 4.8% averagely.The FCP tree was defined to represent the final result of FCP mining and its three characteristics were demonstrated.When the frequent item and the prefix are not coexistent in the new input sequence,the results of contiguous FCP mining are equal.And the corresponding theorem was proved.The BideInc algorithm was proposed to incrementally mine FCPs.The experiments validated the algorithm,and the performance was improved by 47% averagely.

【基金】 航天科技创新基金重点资助项目(CASC04);航天支撑技术基金资助项目
  • 【文献出处】 浙江大学学报(工学版) ,Journal of Zhejiang University(Engineering Science) , 编辑部邮箱 ,2009年08期
  • 【分类号】TP311.13
  • 【被引频次】3
  • 【下载频次】144
节点文献中: 

本文链接的文献网络图示:

本文的引文网络