节点文献
基于语境框架的文本相似度计算
Text Similarity Computing Based on Context Framework Model
【摘要】 介绍了一种新的文本形式化的语义模型———语境框架。语境框架是一个三维的语义描述,它把文本内容抽象成领域(静态范畴)、情景(动态描述)、背景(褒贬、参照等)三个侧面。在语境框架的基础上,设计实现了文本相似度计算算法。算法从概念层面入手,充分考虑了文本的领域和对象的语义角色对相似度的影响,重点针对文本中的歧义、多义、概念组合现象,以及语言中的褒贬倾向,实现了文本间语义相似程度的量化。算法已经应用到文本过滤系统中,用以比较用户过滤要求和待过滤文本之间的相似度。实际应用中取得了比较满意的效果。
【Abstract】 A model of semantic-based text formalization,Context Framework Model(CFM)which is three-coordinate,de-scribes the text as domain,situation and background,is presented in this paper.Based on the Context Framework,the se-mantic frame of text is designed and the algorithm of computing semantic frame is developed.The algorithm,dealing with the domain of the text and the semantic role of the object,computes the synonymy ,polysemy ,and the combination a-mong concepts,and focus on the confusion of commendatory and derogatory.The algorithm is applying to the similarity computing between the queries and the texts in a system of text filtering.As a result,the algorithm can improve the ef-ficiency of text filtering.
【Key words】 text similarity; Context Framework Model(CFM); domain; situation; background; semantic frame of text; com-mendatory; derogatory;
- 【文献出处】 计算机工程与应用 ,Computer Engineering and Applications , 编辑部邮箱 ,2004年16期
- 【分类号】TP391.1
- 【被引频次】103
- 【下载频次】857