节点文献
基于XML的智能信息检索与聚类研究
Research on XML-based web intelligent information retrieval and clustering
【摘要】 目前Web上大多是非结构化的信息,检索主要是通过基于关键词的搜索引擎或目录浏览。近来,许多组织、团体、协会在Web上通过DTD Schema定义XML(ExtensibleMarkupLanguage)文档,由于XML描述了结构化的信息,对XML文档的检索也与以往的搜索引擎不同。为此,本文设计了一个新的基于XML文档的智能信息检索原型系统XI IRC,给出了它的体系结构及功能,并对用户界面、索引机制、查询机制、检索结果概念聚类等问题进行了探讨。
【Abstract】 The current information on the World Wide Web is mainly in the unstructurized form on scattered web sites. The usual retrieval is by based searching engines or directory-based browser. Recently, an increasing number of organizations, groups and associations are adopting DTD/Schema to define XML (Extensible Markup Language) documents on the web. As information becomes more structurized, the retrieval for the XML documents on the Web is different from that by the usual searching engines. To help search these documents, a novel searching system is designed to access XML documents. The architecture and functions of the system are described, and questions as user interface, index system, search system and search results clustering are discussed.
【Key words】 XML; intelligent information retrieval; agent; searching engine; clustering; ontology;
- 【文献出处】 山东建筑工程学院学报 ,Journal of Shandong Institute of Architecture and Engineering , 编辑部邮箱 ,2004年02期
- 【分类号】TP391.3
- 【被引频次】11
- 【下载频次】229