节点文献
WWW上的信息发现与搜索引擎技术
INFORMATION DISCOVERY AND SEARCH ENGINE FOR THE WORLD-WIDE WEB
【摘要】 随着Internet在我国逐步得到普遍应用以及WWW上中文信息量的不断增长,迫切需要研制适合我国国情的中英文Web索引和检索服务系统。WWW的信息发现和搜索引擎又称为robot,负责搜索和获取指定范围内的有关数据。本文对Web搜索引擎的工作原理和关键技术进行了讨论和分析,并介绍了我们在研制中英文Web索引和检索服务器方面所做的工作,包括系统总体结构和汉语分词技术等。
【Abstract】 With the rapid expansion of Internet in China and the continuous increase of the amount of Chinese information on WWW, it is desired to develop Chinese-English Web search and indexing service systems. A WWW information discovery and search engine is called a robot which is in charge of searching and acquiring related data in a given range. This article discusses and analyzes the mechanism and techniques of robot. A prototype of a Chinese-English Web search engine is introduced, including its overall structure and the techniques used for Chinese word segmentation.
【Key words】 WWW Information Discovery and Acquisition Search engine Robot;
- 【文献出处】 小型微型计算机系统 ,MINI-MICRO SYSTEMS , 编辑部邮箱 ,1998年06期
- 【分类号】TP393,
- 【被引频次】99
- 【下载频次】370