节点文献

一种基于语义信息计算XML文档相似度的新方法

A Semantics-Based Method for Calculating Similarity of XML Documents

  • 推荐 CAJ下载
  • PDF下载
  • 不支持迅雷等下载工具,请取消加速工具后下载。

【作者】 雷庆吴扬扬

【机构】 华侨大学计算机科学系

【摘要】 <正>1引言XML(eXtensible Markup Language,可扩展标记语言)的提出给基于Web的应用软件赋予强大的功能和灵活性,正越来越广泛地为开发者和用户所使用。

【Abstract】 As a standard for data exchange on the Web,XML has been used to various domains.Accurate quantitative determination of similarity between two paragraphs of XML documents provides an important basis for a variety of applications of XML document mining and processing.In this paper,we propose a new method for calculating the similarity between two paragraphs of XML documents by taking account of the semantics of XML tags.We design and implement an experiment system,which firstly parses XML documents automatically,thenanalyze the semantic information of XML documents,finally calculate the similarity between two paragraphs of documents. Experiments show that the method can reflect the similar degree between two paragraphs of XML documents accurately.

【Key words】 XMLData miningSemantic similar degree
  • 【会议录名称】 第二十一届中国数据库学术会议论文集(技术报告篇)
  • 【会议名称】第二十一届中国数据库学术会议
  • 【会议时间】2004-10-14
  • 【会议地点】中国福建厦门
  • 【分类号】TP391.1
  • 【主办单位】中国计算机学会数据库专业委员会
节点文献中: 

本文链接的文献网络图示:

本文的引文网络