节点文献
基于Ontology的智能信息检索系统研究
A Study on Ontology-based Intelligent Information Retrieval System
【作者】 王存刚;
【导师】 姚文琳;
【作者基本信息】 中国海洋大学 , 计算机应用技术, 2006, 硕士
【摘要】 当前Web拥有丰富的信息资源,已经成了人们获取信息的重要渠道。但是,由于Web页面的无结构性、半结构性、超链接的自由无序、以及Web内容的海量性、多样性和动态变化,使得人们从Web上获取真正想要的信息其实并不容易。传统的信息检索技术以关键字匹配为主,缺乏语义推理能力,对用户的查询请求也没有提供语义制导,因此造成信息的误检、漏检。因此如何提高Web信息的检索质量成为目前信息检索、数据挖掘和知识管理等研究领域的重要课题。提高Web信息检索质量的根本方法是变无序数据为有序知识,让计算机理解Web信息的含义,从而实现语义检索。为此,Web创始人Tim Berners-Lee于1998年提出了Semantic Web(语义Web)的构想,它是当前Web的扩展,其中的信息被赋予定义良好的(well-defined)含义,使计算机可以理解,从而和人更好的协作。而Ontology为Web信息提供了语义表示机制,是实现语义Web的关键技术。本文分析了传统Web信息检索技术存在的问题与不足,深入研究了Ontology的概念、建模元语、描述语言、构建方法、构建工具,并重点分析了OWL的语义表达能力,以此为基础提出了基于Ontology的智能信息检索系统的框架,阐述了系统的功能和实现机制。本文深入研究了智能信息检索系统涉及到的关键技术,提出了有效的解决方案,为原型系统的开发提供了理论支持。关键技术主要包括:Ontology的构建技术、Ontology的存储技术以及基于Ontology的信息检索策略。本文设计实现了基于Ontology的智能文献检索原型系统PaperSearch。在该系统中,构建了计算机学科的领域Ontology和文献Ontology。PaperSearch提供面向专业的检索服务,检索方式灵活多样,能够对用户提供语义制导,有较强的推理能力,实现了基于知识的语义检索。实验证明,该系统能提高信息检索的质量和效率,从而验证了理论的正确性。
【Abstract】 Nowadays, Web becomes the main information resource for People. However it is not easy for them to get the really interested information on the Web, since web pages are semi-structure or non-structure, the hyperlinks are disordered and the data are massive, various and dynamic. Traditional information retrieval technology is mainly based on keyword matching and has little semantic inferring ability. Moreover it does not provide semantic guidance for users. So information retrieval System may miss some information that users really want and return some information that users don’t want. How to improve the quality and efficiency of information retrieval becomes an important study field of information retrieval (IR), data mining (DM) and knowledge management (KM).The essential method to improve the quality and efficiency of information retrieval is to change disorderly data into orderly knowledge, to make computers understand the web information and the need of people and finally realize semantic information retrieval. Tim Berners-Lee proposed the concept of Semantic Web in 1998. It is an extension of the current web in which information is given well-defined meaning, better enabling computers and people to work in cooperation. Ontology provides semantic expression mechanism for Web information and is the key technology of Semantic Web.This paper analyzes the problem and disadvantage of traditional retrievals in Web. Following, this paper makes a deeply research on the concepts, modeling primitives, describing languages, constructing methods, constructing tools of ontology and analyzes the semantic ability of the OWL. Furthermore, a framework of intelligent information retrieval system based on Ontology is proposed. Key techniques have been also deeply studied in this paper. As the theory basis of prototype system, three solutions about Ontology construction, Ontology storage and semantic information retrieval strategy are proposed.
【Key words】 Ontology; Semantic Web; Semantic Retrieval; Intelligent Information Retrieval;
- 【网络出版投稿人】 中国海洋大学 【网络出版年期】2007年 02期
- 【分类号】TP391.3
- 【被引频次】19
- 【下载频次】480